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Preface 


Dedekind and Weber’s 1882 paper on algebraic functions of one variable is one 
of the most important papers in the history of algebraic geometry. It changed the 
direction of the subject, and established its foundations, by introducing methods 
from algebraic number theory. Specifically, they used rings and ideals to give rig- 
orous proofs of results previously obtained, in nonrigorous fashion, with the help 
of analysis and topology. Also, by importing ideas from number theory, the paper 
revealed the deep analogy between number fields and function fields—an analogy 
that continues to benefit both number theory and geometry today. 

The influence of the paper is obvious in 20th-century algebraic geometry, where 
the role of arithmetic/algebraic methods has increased enormously in both scope 
and sophistication. But, as the sophistication of algebraic geometry has increased, 
so has its detachment from its origins. While the Dedekind-Weber paper continues 
to be cited, I venture to guess that few modern algebraic geometers are familiar 
with its contents. There are a few useful commentaries on the paper, but those 
that I know seem to focus on a few of the concepts used by Dedekind and Weber, 
while ignoring others. And, of course, fewer mathematicians today are able to read 
the language in which the paper was written (and I don’t mean only the German 
language, but also the mathematical language of the 1880s). 

I therefore believe that it is time for an English edition of the paper, with 
commentary to assist the modern reader. My commentary takes the form of a 
Translator’s Introduction, which lays out the historical background to Dedekind 
and Weber’s work, plus section-by-section comments and footnotes inserted in the 
translation itself. The comments attempt to guide the reader through the original 
text, which is somewhat terse and unmotivated, and the footnotes address specific 
details such as nonstandard terminology. The historical background is far richer 
than could be guessed from the Dedekind- Weber paper itself, including such things 
as Abel’s results in integral calculus, Riemann’s revolutionary approach to complex 
analysis and his discoveries in surface topology, and developments in number theory 
from Euler to Dedekind. The background is indeed richer than some readers may 
care to digest, but it is a background against which the clarity and simplicity of 
the Dedekind-Weber theory looks all the more impressive. 

I hope that this edition will be of interest to several classes of readers: historians 
of mathematics who seek an annotated edition of one of the classics, mathematicians 
interested in history who would like to know where modern algebraic geometry came 
from, students of algebraic geometry who seek motivation for the concepts they are 
studying, and perhaps even algebraic geometers who have not had time to catch up 
with the origins of their discipline. (It seems to an outsider that just the modern 
literature on algebraic geometry would take more than a lifetime to absorb.) 
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viii PREFACE 


This translation was originally written in the 1990s, but in 2011 I was motivated 
to revise it and write an introduction in order to prepare for a summer school 
presentation on ideal elements in mathematics. I have also compiled a bibliography 
and index. The bibliography is mainly for the Translator’s Introduction, but it is 
occasionally referred to in the commentary on the translation, so I have placed it 
after the translation. 

The summer school, PhilMath Intersem, was organized by Mic Detlefsen, and 
held in Paris and Nancy in June 2011. I thank Mic for inviting me and for support 
during the summer school. I also thank Monash University and the University of 
San Francisco for their support while I was researching this topic and writing it up. 
Anonymous reviewers from the AMS have been very helpful with some technical 
details of the translation, and I also thank Natalya Pluzhnikov for copyediting. 
Finally, I thank my colleague Tristan Needham, my wife Elaine, and son Robert 
for reading the manuscript and saving me from some embarrassing errors. 


John Stillwell 
South Melbourne, 1 May 2012 


https://doi.org/10.1090/hmath/039/01 


Translator’s Introduction 


1. Overview 


Modern algebraic geometry has deservedly been considered for a long 
time as an exceedingly complex part of mathematics, drawing prac- 
tically on every part to build up its concepts and methods and in- 
creasingly becoming an indispensable tool in many seemingly remote 
theories. It shares with number theory the distinction of having one 
of the longest and most intricate histories among all branches of our 
science, of having always attracted the efforts of the best mathemati- 
cians in each generation, and of still being one of the most active areas 
of research. 

Dieudonné (1972), p. 827. 


It seems to me that, in the spirit of the biogenetic law, the student who 
repeats in miniature the evolution of algebraic geometry will grasp the 
logic of the subject more clearly. 

Shafarevich (1994), p. vii. 


Richard Dedekind and Heinrich Weber first worked together in 1874, as co- 
editors of Riemann’s collected works. Weber was called into this project as a 
replacement for Clebsch, who had died unexpectedly of diptheria, and his expertise 
in mathematical physics complemented Dedekind’s expertise in pure algebra and 
analysis. The fruit of this collaboration was their joint paper, Dedekind and Weber 
(1882), a ground-breaking contribution to the understanding and advancement of 
Riemann’s ideas. Theorie der algebraischen Functionen der einer Verdnderlichen 
(theory of algebraic functions of one variable) revolutionized algebraic geometry 
by introducing methods of algebraic number theory into the subject. This made 
possible the first rigorous proofs of theorems discovered with the help of physical 
intuition, and opened the way to an extension of algebraic-geometric concepts from 
the complex numbers to arbitrary fields. 

In a sense, the paper is a sequel to Dedekind (1877), a long paper in which 
Dedekind expounded his theory of ideals and their applications to number theory. 
However, Dedekind and Weber give a self-contained exposition of their theory, 
which is at some points simpler than the ideal theory for algebraic numbers. 

Like Dedekind (1877), the Dedekind-Weber paper starts with the concept of 
field, but this time it is a field of functions, the “algebraic functions of one variable.” 
Following the example of number theory, they distinguish the ring of integers of this 
field, then the primes, and finally the ideals. As in number theory, it turns out that 
ideals are crucial to complete the analogy with the traditional arithmetic of integers. 
However, in the context of algebraic functions, ideals prove to be important in other 
ways, and indeed a more general idea that they call “polygons” is needed. 
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To show the value of these new ideas, Dedekind and Weber gave new proofs of 
two great theorems: Abel’s theorem of Abel (1841) and the Riemann-Roch theorem 
of Riemann (1857) and Roch (1864). These theorems are as timely today as they 
were in 1882, but they require some introduction, which Dedekind and Weber do 
not supply. I would therefore like to present some historical background to these 
theorems, and to the theory of algebraic functions itself, with copious examples. In 
some ways, this introduction is a sequel to my introduction to Dedekind (1877), 
though I will recapitulate some points to keep it self-contained. 

In preparing this material I have been greatly assisted by the first 35 pages of 
Dieudonné (1985), an extraordinarily rich and insightful account of the development 
of algebraic geometry up to the Dedekind-Weber paper, and Koch (1991), which 
places this development against the general background of 19th-century mathemat- 
ics. Another helpful overview is the chapter by Geyer in Dedekind et al. (1981). 
As will become apparent, much of the algebra in modern algebraic geometry arose 
from problems in classical analysis, particularly the integral calculus. The first such 
result was the fundamental theorem of algebra, originally motivated by the desire 
to factorize polynomials for the purpose of integrating rational functions. 


2. From Calculus to Abel’s Theory of Algebraic Curves 


What a discovery is Abel’s generalization of Euler’s integral! I have 
never seen such a thing! But how can it be that this discovery, which 
could be the most important made in the mathematics of this century, 
and which was communicated to your Academy two years ago, has 
escaped the attention of you and your colleagues? 

Jacobi (1829) letter to Legendre, 14 March 1829. 


When calculus was developed in the 17th century, the first really hard problems 
were problems of integration. This was especially true of the Leibniz approach, 
which sought integrals in “closed form,” that is, in terms of functions from the 
small class known as “elementary.” These are the algebraic functions, together 
with functions arising from them by composition with the exponential function and 
its relatives, the logarithm, circular functions, and inverse circular functions. 

The only broad class of functions that can be integrated in Leibniz’s sense 
are the rational functions, that is, the functions of the form r(x) = p(x)/q(x), 
where p and q are polynomials. Any rational function can be integrated because 
the denominator g(x) may be split into linear factors (a — a), by the fundamental 
theorem of algebra, and the quotient p(x) /q(a) may then be decomposed into partial 
fractions of the form x” /(a — a)", which have rational integrals in all cases except 


dx 
= log(# — a) + constant. 
r-—a 
Thus the integral of a rational function is itself a rational function, with the possible 
exception of some terms of the form log(« — a). 

(In elementary calculus courses this simple picture is confused by the presence 
of partial fractions such as 1/(x? + 1), the integral of which is usually taken to be 
tan! a + constant. However, we have 


1 1/2 1/2 


9 


e+lo oxe-i axti 
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so we can also express [ dz/(x* +1) as a sum of logarithms, namely 


dx 1 dx 1 dx 1 . 1 : 
jaime | saa eee 

This was first done, albeit with some confusion about the meaning of complex loga- 
rithms, by Johann Bernoulli (1702). Around 1800, when the fundamental theorem 
of algebra was finally proved, the meaning of complex numbers became better un- 
derstood, and it became increasingly clear that they play an important role in the 
theory of integrals.) 

When the rational functions are extended by as little as the square root func- 
tion, the resulting integrals quickly fall outside the class of elementary functions. 
A famous example is the lemniscatic integral 


“dt 
a) = | 
(x) | Woe 
so-called because it expresses the arc length of the lemniscate of Jakob Bernoulli 
(1694), shown in Figure 1. 


FIGURE 1. The lemniscate of Bernoulli 


This curve has cartesian equation (x? +y?)? = x?—y?, and its arc length cannot 
be expressed in terms of elementary functions of x and y. However, Fagnano (1718) 
discovered that the lemniscatic integral satisfies an arc-length doubling formula 


ca 2a/1—at/ (1404) dt 

et, To 
and Euler (1768) generalized Fagnano’s formula to an arc-length addition formula 
. eee dn ae (eV/1-yt+yVI=29)/(14+07y?) gy 
©) [ eae Foe Vi-# 
These results are analogous to properties of the inverse sine integral 


g=sinte= fo 

o0 Vl-t , 

which are derivable from basic properties of the sine and cosine functions. For 
example, the familiar angle-doubling formula 


sin 20 = 2sin@cos@ = 2sin0V1 — sin? 6, 
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implies that 
20 = sin~'(2sinOv'1 — sin? 6), 


which gives the angle-doubling formula for integrals: 


5 ae aT dt [ T—a22 dt 
sin” 2 = —— = =. 

0 Vl-# 0 V1-?? 
And the familiar angle addition formula, 


sin(@ + ») = sin@ cosy + cos@sin yp 
= sin#\/1— sin? y+sinyy 1 -— sin’, 
implies 


6+y=sin | ( sin@\/1— sin? y+ sinygv 1 — sin’ @ | , 
p p p 


which gives the addition formula for arcsine integrals: 


© gt y dt ey lee ty Ee ae 
bawlwa [ V1—t 

Thus in both cases we find that a sum of two integrals, [) f(t) dt + Jy f(t) dt, 
can be simplified to a single integral, ie f(t) dt, where z is an algebraic function of 
x and y. 

It so happens that the integrand 1/1 — ?? of the inverse sine integral can be 
rationalized by the change of variable t = 2s/(1+s”)—not so surprisingly, since the 
inverse sine is an elementary function—so we can eliminate the integral altogether 
in this case. However, in the case of the lemniscatic integral, reducing the sum of 
two integrals to one is the best we can do. The integrand 1/V1-— t* cannot be 
rationalized by a change of variable, and indeed Jakob Bernoulli (1704) made a 
remarkable attempt to prove this, using the theorem of Fermat that the equation 
X4— y+ = 7? has no solution in positive integers X,Y, Z. His attempt fell short, 
because it is not enough to know this theorem for integers X,Y, Z. But it is enough 
to know it for polynomials X(t), Y(t), Z(t), and indeed polynomials behave enough 
like integers that Fermat’s proof can be replayed for polynomials, though no one 
noticed this in Bernoulli’s time. 

Thus there is an essential difference between the ordinary sine function and 
the lemniscatic sine function, sl, defined as the inverse of the lemniscatic integral. 
Nevertheless there are enough similarities to enable the development of a theory of 
the lemniscatic sine function. This was begun by Gauss in 1796, and extended to 
a general theory of the so-called elliptic functions by Abel and Jacobi in the 1820s. 
Like the circular functions, the elliptic functions satisfy addition formulas and they 
are periodic, only more so. Just as the sine and cosine have period 27, in the sense 
that 


sin(? + 27) =sin@, cos(@ + 27) = cos, 


an elliptic function f has two periods w,,we, in the sense that 


fletor)=f(z), fle +w2) = f(z). 
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The periods w,,w2 are complex numbers whose ratio is not real. For example, 
Gauss discovered that the two periods of sl are @ and iw, where 


5 : dt 
Ody VIR 
The double periodicity of elliptic functions was first explained by algebraic manip- 
ulation of integrals, but Riemann (1851) found a far more transparent geometric 
explanation (not unlike explaining the period 27 of sine and cosine by referring to 
the circle), which we will come to later. 

The theory of elliptic functions was the first great advance in integral calculus 
since the integration of rational functions. Nevertheless, this theory only scratched 
the surface of a huge and important world of calculus: the integrals of algebraic 
functions; that is, integrals of the form 


Jos t)dt, where s satisfies a polynomial equation P(s,t) = 0. 


The lemniscatic integral is f[ dt/s, where s? = 1 — t*, and the general theory of 
elliptic functions deals with the integrals f{ dt/s where s” equals a polynomial of 
degree 3 or 4 in t. But what can one say, for example, about the integral 


| “dt 4 
0 VI- tS 
It turns out that this integral does not satisfy an addition formula 
“dt Yoodt of? dt 
i vi-® =) vi-® =, Vi-e 
where z is an algebraic function of x and y. However, Abel (1841) discovered a 
wonderful substitute for an addition formula: any sum of integrals, 


ie dt Rea _- dt 
o v1l—-¢# 0 v1l-t® 
is equal to the sum of two integrals 


“1 dt 72 dt 
—S—— + ———.,_ where 21, 22 are algebraic functions of 71,...,%m, 
I V1—# I v1—#6 . 
plus some “trivial” algebraic and logarithmic terms. 
This result is only an illustration of the amazingly general: 


Abel’s Theorem. For any integral of the form { g(s,t) dt, where g is a rational 
function and s and t are connected by a polynomial relation P(s,t) =0, there is a 
number p such that any sum of integrals 


v1 lm 
i gfs.t)dt ++ f g(s,t) dt 
0 0 


equals a sum of at most p integrals 


Z1 Zp 
| g(s,t) arse + | g(s,t) dt, 
0 0 
where z1,...,%p are algebraic functions of 11,...,%m, plus terms that are either 
rational functions or their logarithms. 


The number p depends only on the polynomial P. It was later called the 
genus of the curve defined by P(s,t) = 0, and it too found a natural geometric 
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interpretation in Riemann (1851), as we will see in the next section. In particular, 
the curve s? = 1 — ¢* that yields the lemniscatic integral has genus 1, because any 
sum of lemniscatic integrals reduces to one integral by repeated application of the 
addition formula (*). More generally, any elliptic curve! s? = q(t), where q(t) is 
of degree 3 or 4 without repeated roots, is of genus 1, because there is an addition 
formula for the corresponding integral [ dt/s. 

Finally, any curve s = P(w), t = Q(w) parameterized by rational functions P 
and Q is of genus zero, because the corresponding integral [ g(s, t) dt is the integral 
of the rational function g(P(w), Q(w))Q’(w). 


Abel submitted his paper to Cauchy in 1826 but, due to inattention by the 
mathematicians of the Paris Academy, it was not published at the time. It was 
noticed by Jacobi, however, who in 1829 wrote the letter to Legendre quoted at 
the beginning of this section. Even the intervention of Jacobi failed to wake up the 
Academicians, and Abel’s paper did not appear until 1841, long after Abel had died. 
There is a further excruciating twist to this story of neglected genius. The other 
mathematician notoriously ignored by the Paris Academy, Evariste Galois, also 
seems to have discovered Abel’s theorem, independently of Abel, but some years 
later. It is mentioned in his letter to Auguste Chevalier, Galois (1846), written 
on the night before his death in 1832. He states the theorem without proof, but 
with some additional remarks that suggest that he already had some of the ideas 
developed by Riemann 20 years later. 


3. Riemann’s Theory of Algebraic Curves 


It is quite a paradox that in the work of this prodigious genius, out 
of which algebraic geometry emerges entirely regenerated, there is 
almost no mention of algebraic curve; it is from his theory of algebraic 
functions and their integrals that all of the birational geometry of the 
nineteenth and the beginning of the twentieth century issues. 
Dieudonné (1985), p. 18. 


In the 1850s, two papers by Bernhard Riemann? completely changed the face 
of complex analysis and algebraic geometry. Riemann (1851) and Riemann (1857) 
viewed algebraic curves in a new way, as what we now call Riemann surfaces. In 
retrospect, this development seems unsurprising and even inevitable. Since around 
1800, mathematicians had become used to the idea that the complex “line” C was 
geometrically a plane, so the idea that a complex “curve” should be some kind 
of surface was just over the horizon. Nevertheless, Riemann’s description of these 
surfaces was received skeptically by most of his contemporaries. The underlying 
topological ideas, though very intuitive and persuasive, did not yet have a rigorous 
foundation. And, to make matters worse, Riemann made connections between 
topology and analysis by appealing to physics. Then, as now, this was considered 
mathematically dubious. 


lThe name “elliptic” became attached to the curves of genus 1 because the corresponding 
integrals (“elliptic integrals” ) include the integral for the arc length of the ellipse. Unfortunately, 
the ellipse itself has genus 0, and hence is not an elliptic curve. 

2Page numbers in references to these papers in this Introduction refer to the original papers. 
However, many readers will find it more convenient to consult the English translation of Riemann’s 
works, Riemann (2004). To make this easier to do, I also give section numbers, which are the 
same in the original papers and in the translation. 
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But if Riemann’s proofs were not rigorous, his results were so stunning that 
they demanded explanation, and this became the task of later mathematicians, 
among them Dedekind and Weber. 

Today, the necessary foundations of topology and analysis have been con- 
structed, so we have the luxury of describing Riemann’s ideas in informal terms 
similar to his own. I think that it is useful to do so, because some of the algebraic 
concepts devised by Dedekind and Weber are scarcely comprehensible if one has 
not seen the topological concepts they replace. In particular, I doubt that readers 
should be confronted with the “ramification ideal” before they have seen a picture 
of “ramification,” or “branching.” Such a picture was given in Neumann (1865), 
the first textbook on Riemann’s theory (Figure 2). 


FIGURE 2. Neumann’s picture of a branch point 


This picture springs to mind when one attempts to visualize the curve y? = x 
for complex variables x and y or, equivalently, the “two-valued function” y = +./2. 
Riemann imagined the two values +,/z and —\/z lying above x on a two-sheeted 
covering of the plane C, as shown in Figure 3. Notice that, as moves continuously 
once around a circle, the corresponding point ,/z moves continuously around the 
lower sheet, then the upper sheet, of the two-sheeted covering, eventually taking the 
value —,/z that also lies above z. Thus the function \/x becomes “single-valued” 
on the covering surface. 

The point z = 0 at which the two sheets fuse is called a branch point or 
ramification point of the covering, because one used to speak of the “branches of 
the multi-valued function” —in this case the two “branches” are \/z and —,/z. The 
awkward feature of the picture—that the two sheets appear to pass through each 
other—is a result of representing the relation y = x? in three dimensions, one fewer 
than the four dimensions it really requires. One can visually add a fourth dimension, 
“shade of gray,” to the sheets to avoid their meeting in the fourth dimension. This 
has actually happened in the Neumann picture, where one sheet is white where 
they appear to meet and the other is dark gray. 
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20 

30/2 m/2 
20 


FIGURE 3. Branch point for the square root 


Just as the curve y? = x has a branch point of the two sheets at x = 0, 
the curve y” = x has a branch point of n sheets. An arbitrary algebraic curve 
P(x,y) = 0, where P is a polynomial of degree n, is an n-sheeted covering of C 
with a finite number of branch points. Since behavior of a curve at infinity is 
important, Riemann (1857) (Section 1, p. 117) extended C by a point oo, and the 
resulting set C U {oo} can be viewed as a sphere via the stereographic projection 
map shown in Figure 4. This idea is made explicit in Neumann (1865), p. 132. 
Under stereographic projection, each point z € C corresponds to a point z’ on the 
sphere other than the north pole N, and N itself naturally corresponds to oo. 


FIGURE 4. Stereographic projection of the sphere to C U {co} 


Corresponding to this completion of C to a sphere, we have a completion of each 
algebraic curve to a finite-sheeted covering of the sphere with finitely many branch 
points. Riemann realized that the covering surface S is topologically characterized 
by none other than Abel’s number p, later dubbed the genus (or Geschlecht in Ger- 
man) by Clebsch (1865). Riemann described p topologically as half the number of 
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closed cuts needed to make S simply connected (that is, such that any closed curve 
can be contracted to a point). In this case the resulting simply connected surface 
is a polygon with 4p sides. Mébius (1863) gave an even simpler interpretation of p, 
by showing that each Riemann surface is topologically equivalent to a member of 
the sequence of surfaces shown in Figure 5, namely, the one with p “holes.” 


Sloe 


FIGURE 5. Riemann surfaces of genus 0, 1, 2,... 


As an example, consider the elliptic curve 
y? = x(x —1)(x +1). 
This curve is a two-sheeted cover of the sphere, with branch points like that shown 
in Figure 3 at x = 0,1, —1, 00. If we slit the sheets by cuts from 0 to oo, and from 


1 to —1, then, in order to obtain the branching, the edges of the cuts need to be 
identified so that the like-labeled edges shown in Figure 6 come together. 


FIGURE 6. How edges are identified at branch points 


But we can make a surface that is topologically the same by separating the two 
sheets before making the identifications, as shown in Figure 7. 

The resulting surface is topologically a torus, shown in more familiar form in 
Figure 8. Thus Abel’s number p = 1 agrees with the topological genus of the torus, 
because the torus has one “hole.” (Notice that if 0 and oo are the only branch 
points, as is the case with y? = zx, then the result of joining the two sheets is 
topologically a sphere, so the genus of y? = x is zero.) 

Moreover, as promised in the previous section, we can now see the reason for 
the two periods of elliptic functions associated with the curve y? = x(a#—1)(a+1). 
The periods are integrals over independent closed paths on the torus surface, such 
as the paths C, and C2 shown in Figure 8. 
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FIGURE 7. Identifying edges after separating the sheets 


FIGURE 8. Independent closed paths on the torus 


4. The Riemann-Hurwitz Formula 


For a simply connected surface, spread over a finite region of the z- 
plane, there is a relationship between the number of simple branch 
points and the number of windings of its boundary curve .... From 
this there results a relation, for a multiply connected surface, between 
these numbers and the number of transverse cuts needed to transform 
it into a simply connected surface. This relation, which does not 
depend on metric considerations and belongs to analysis situs, can be 
derived as follows ... 

Riemann (1857), Section 7, pp. 127-128. 


As we saw in the previous section, Riemann viewed an algebraic curve as a 
surface covering the sphere C U {co}. For a curve of degree n the covering is of n 
“sheets”: that is, above each point of C U {oo} there are n different points of the 
curve, with the exception of finitely many ramification points where two or more 
sheets come together. Riemann (1857) showed that the genus can be computed 
from the degree n and numbers ep that give the number of sheets fused together at 
a ramification point P. Because of a generalization made by Hurwitz (1891), the 
method is now called the Riemann-Hurwitz formula. 
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The Riemann-Hurwitz formula is most easily explained in terms of the oldest 
manifestation of the genus concept, the Euler characteristic, which goes back to 
Euler (1752). Euler observed that, for any decomposition of the sphere into F 
faces, F edges, and V vertices one has V — E+ F' = 2. More generally, the number 
V — E+ F is invariant for any surface, and it equals 2 — 2p, where p is the genus. 
The invariance of V — E+ F follows by observing that any two partitions of a 
surface have a common subdivision, obtained by superimposing one on the other. 
(This argument is merely plausible, since an edge in one partition may intersect 
edges of the other partition infinitely often, but Riemann used it in Riemann (1851), 
Section 6, p. 10.) And any subdivision may be obtained by a series of elementary 
subdivisions of the following two types: 

(1) Subdividing an edge by a new vertex. 

(2) Subdividing a face by a new edge connecting two of its vertices. 
The first increases both V and E by 1, the second increases both F and F by 1, so 
neither changes V — E+ F. 

Thus V — E+ F is invariant and, by computing it for a standard subdivision 
of the genus p surface (such as the one shown for the torus in Figure 8), one finds 
that V-— E+ F = 2- 2p. This quantity is now called the Euler characteristic, 
x(S), of the surface S of genus p. It enables us to compute the genus from a 
decomposition of the surface into vertices, edges, and faces, which is particularly 
convenient when S is realized as a ramified covering of the sphere. If S were simply 
an n-sheeted covering of the sphere then we should have y(S) = 2n, since 2 is 
the Euler characteristic of the sphere and S has n vertices over each vertex on the 
sphere, n edges over each edge, and n faces over each face. 

However, when n > 1, the covering has ramification points, so we have to adjust 
the count of vertices. We assume that the ramification points are included among 
the vertices on the sphere, in which case the count of vertices on S must be reduced 
by ep — 1 at each ramification point P. On the other hand, the numbers of edges 
and faces on S are still n times the corresponding numbers on the sphere, so we 
have 

x(S) = 2n — Sc(ep —1). 
P 
Thus the genus p of S depends only on the degree n of the curve and its ramification 
numbers ep. 

As an example, consider the 2-sheeted covering of the sphere with the four 
branch points P = —1,0,1,00 in the previous section. At each of these points 
ep = 2, so the Riemann-Hurwitz formula gives 2— 2p = 2.2—4=0. Hence p = 1, 
as obtained previously. 

Since x(S) = V — E+ F = 2 -— 2p, the above formula for y(S) may be written 
as the following formula for genus: 


1 
Pag Pon rl; 


where w = ))p(ep — 1) is the sum of the ramification numbers. This formula is 
used as the definition of genus in §24 of the Dedekind-Weber paper. 

The determination of genus by degree and ramification numbers is crucial to 
Dedekind and Weber’s program of deriving Riemann’s results by algebraic methods. 
Their algebraic definition of a “Riemann surface” S lacks geometric structure, so 
it is not meaningful to decompose S by vertices, edges, and faces (nor can one 
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make “cuts” in it between the ramification points, as in Figure 6). However, it 
is possible to show that there are n distinct points of S over each point of the 
sphere CU {oo}, with the exception of finitely many ramification points P, and the 
ramification number ep at each such point can be defined algebraically. Thus they 
can define genus in a way that agrees with Riemann’s concept of genus, by means 
of the Riemann-Hurwitz formula. 


5. Functions on Riemann Surfaces 


A system of like-branching algebraic functions and their integrals is 
the main object of our study; but instead of proceeding from expres- 
sions for these functions we define them by their discontinuities with 
the help of Dirichlet’s principle. 

Riemann (1857), Section 1, p. 117. 


It certainly is somewhat daring to infer the existence of U from its 
hydrodynamic significance. 
Weyl (1964), p. 107. 


With the insight gained from his view of algebraic curves as surfaces covering 
the plane C or the sphere C U {oo}, Riemann was able to generalize Cauchy’s 
theory of integration on C to integration on arbitrary algebraic curves. To see 
where this may lead, let us first recall some of Cauchy’s main results, and some of 
their immediate consequences. Cauchy developed his theory, in stages of increasing 
generality, between 1814 and 1831. For a very readable and insightful account of 
this period, see Smithies (1997). 

The fundamental result is Cauchy’s theorem, according to which 


[ f(2)dz=0, 


where f is differentiable or holomorphic (and hence not infinite) on and inside the 
closed curve C’ in the complex plane C. It follows that ihe f(z) dz does not depend 
on the path chosen between a and 0, as long as the paths lie in a simply connected 
region of C where f does not become infinite. 
Of course, it is a different story for functions that do become infinite, such as 
f(z) =1/z at z =0. We find, for example, that 
de = 271, 
cz 
where C is a clockwise path around the unit circle. Nevertheless, the value of the 
integral of 1/z around a closed path C' depends only slightly on C—it is the same 
around any path C’ to which C can be deformed without crossing the point z = 0. 
This example is generalized in Cauchy’s integral formula and Cauchy’s residue 
theorem. The integral formula says that 


(e-2—) oe 


— Oni to z-a’ 


where C is a circle with center a in a region where f is differentiable, and it has 


the consequence that 
! d 
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so a differentiable complex function f in fact has derivatives of all orders. It also 
follows that f has a Taylor series expansion in the neighborhood of a point where f’ 
exists, and from this we can conclude that if f(a,) = 0 for a convergent sequence of 
points az, then f is constantly zero. Another important consequence is the so-called 
Liouville’s theorem: if f is differentiable and bounded on C then f is constant. (This 
theorem was discovered by Liouville in 1844 but first published by Cauchy (1844); 
see Liitzen (1990), p. 124.) 

The residue theorem generalizes the property of the function 1/z to an arbitrary 
meromorphic function f—one that is differentiable except at isolated poles z = a, 
in the neighborhood of which 


f(z) =c-n(z- a) +++ +e-(z-a) | +e9 + (z- a) Fea(z—a)?t+--. 


The integer h is called the multiplicity or order of the pole z = a, and the coefficient 
c_ 1 is called the residue of f at z =a. For such a function, 


/ f(z) dz = 2nic_, 
C 


where C is a clockwise path around z = a, small enough not to touch or enclose 
any other pole of f. Thus the value of the integral depends only on the residue. 
More generally, if C' is a path running clockwise around poles of f at which the 
residues are r1,...,%, then 


i: f(z) dz = 2ni(r1 +--+ + 1m). 
Cc 


With these theorems we can find all the functions f on CU {oo} that are 
meromorphic. They are precisely the rational functions. A rational function 


f(z) = ——, where p and g are polynomials, 


is differentiable except at its finitely many poles where q(z) = 0, so a rational 
function is certainly meromorphic. 

Conversely, if f(z) is meromorphic on CU {oo} and f is not the constant zero, 
then it can have only finitely many zeros and poles. (If not, there are infinitely 
many zeros or poles in a neighborhood of oo. Infinitely many zeros force f to be 
the constant zero, and infinitely many poles destroy differentiability near infinity.) 
Now, multiplying f(z) by a rational function g(z) that cancels its zeros and poles, 
we obtain a function h(z) = f(z)g(z) that is finite and nonzero everywhere except 
possibly at z = oo. In any case, either h(z) or 1/h(z) is bounded, hence constant 
by Liouville’s theorem. Thus f(z) is a constant multiple of the rational function 
1/g(z), hence rational itself. 

Now a rational function f(z) = p(z)/q(z) is determined, up to a constant 
multiple, by its zeros and poles and their orders. Assuming any common factors of 
p(z) and q(z) have been canceled, the zeros of p(z) give zeros of f(z) and the zeros 
of q(z) give poles of f(z). In addition, if deg(p) 4 deg(q), there will be a zero or 
pole at oo according as deg(p) < deg(q) or deg(p) > deg(q), making the total order 
of zeros equal to the total order of poles. 

To summarize: a meromorphic function on C U {oo} is determined, up to a 
constant multiple, by a finite set of zeros and poles, with associated orders. A 
function with given zeros and poles actually exists (namely, a rational function) 
provided that the total order of zeros equals the total order of poles. 


eS ma 
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Thus Riemann’s program of “defining a function by its discontinuities” is easily 
carried out on CU {oo}, a Riemann surface of genus zero. We need only understand 
the “discontinuities” to be the zeros and poles. Implicitly, Cauchy had the result, 
though perhaps not the means of expressing it in terms of functions on a surface. 
However, extending this result to all surfaces of genus zero is already a significant 
problem. It was solved by Riemann (1851) via the Riemann mapping theorem, a 
difficult result that he proved by appealing to what he called the Dirichlet principle. 
The Dirichlet principle is a powerful method for proving the existence of functions 
with given properties (such as specified zeros and poles) on surfaces, but it stems 
from physical intuition about the flow of electricity and was not proved in a form 
suitable for Riemann’s applications until after 1900. 

It should be stressed that this remarkably simple view of meromorphic functions 
on C U {co} is made possible by the point co. We would now say that the role of 
co is to make the surface compact. Compactness ensures that any infinite set of 
points has a limit, which makes possible the above argument that a meromorphic 
function on C U {co} is rational. The meromorphic functions on C are a much less 
manageable class, since they include functions that are not even algebraic, such 
as e*. The Riemann (1857) view of an algebraic curve as a surface covering the 
sphere led him to an equally simple view of the meromorphic functions on such a 
curve—one that would be the starting point of the Dedekind-Weber theory—they 
are algebraic functions, of degree bounded by the degree of the curve. 

To see why, suppose that f is s meromorphic function on a Riemann surface S, 
so f has only finitely many poles on S. If S is of degree n then there are n points 
X1,---,Xn (not necessarily distinct) over each x € CU {oo}. Consider the values 
f(a1),-.--,f(an) of f at these points. Then the “multi-valued function” y = f(x) 
satisfies 


(y— f(#1))---(y— f(en)) = 0. 
Expanding the left side, we get 


(*) y” + an—1y"* + +++ tary + a9 = 0, 
where 


ao = (—1)" f(@1)--- F(@n); 


Oni = —(f (@1) +2 >t F@an)) 


are the elementary symmetric functions of f(21),..., f(a@n). Because of their sym- 
metry, @o,..-@n,—1 are well-defined meromorphic functions of « € CU {oo}. So, 
y = f(a) satisfies a polynomial equation (*) of degree n whose coefficients are ra- 
tional functions of x, and this means that f(#) is an algebraic function of x, of 
degree at most n. This is essentially the argument of Riemann (1857), Section 5, 
p. 123. 

On a surface of genus p > 0 it could likewise be proved that a meromorphic 
function is determined, up to a constant multiple, by its zeros and poles, and that 
the total order of zeros equals the total order of poles. However, it could also be 
proved that there are further constraints on existence, arising from closed paths 
that do not bound a piece of the surface and the corresponding periods of integrals. 
For example, on a surface of genus 1 there is no function with one pole of order 1 
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and one zero of order 1. This raises the problem of finding functions, with given 
zeros and poles, that satisfy the known constraints. 

For genus p = 1, constraints were found by Abel (1827), and he was also able 
to prove that any finite set of zeros and poles satisfying these constraints could be 
realized by a meromorphic function, that is, by an elliptic function. He did this 
with the help of “expressions for these functions,” which he and Jacobi had already 
raised to a fine art. “Expressions” for functions on surfaces of higher genus were 
much less developed in Riemann’s time, so he actually made a virtue of a necessity 
by appealing to Dirichlet’s principle in order to prove their existence. The trouble 
was, Weierstrass (1870) showed that the Dirichlet principle fails in certain cases, 
so Riemann’s methods were under suspicion for some decades. 

Riemann (1857) considered functions on a surface of genus p with simple poles 
(that is, poles of order 1) at r given points. These functions form a vector space 
(over the field C) whose dimension / Riemann proved to satisfy 


l>r—p+l 


(Riemann’s inequality). Riemann did not use the language of vector spaces and 
dimensions,? which did not yet exist; he said that the functions have | “arbitrary 
constants.” The inequality generalizes to the case where the poles have multiplici- 
ties d,,...,d,, in which case 


L> (di +---+d,)-—p+1. 


Riemann’s student Roch (1865) turned the inequality into an equality by in- 
terpreting the difference 


L—[(d; +-+:+d,)—p+]] 


as the dimension of a space of certain functions, today called the canonical class. 
In the special case of meromorphic functions on C U {oo} we can see that 


l=(d,+---+d,)-p+l, 


because in this case p = 0 and the meromorphic functions with r poles are of the 
form (2) 

D(z 

f(z) a k(z = pu in (z _ Dro , 

if co is not one of the poles. In this case p(z) can be any polynomial of degree 
at most d; +---+d, (the degree of g(z)), and the space of such functions indeed 
has dimension (d; +---+d,) +1, because there are (dj + ---+d,) +1 arbitrary 
constants in the definition of a polynomial of degree (dj +---+d,). If co is a 
pole, then its order is the difference deg(p) — deg(q), and it is easily checked that 
|= (d, +---+d,) +1 in this case also. 

We will explain in Section 9 how Dedekind and Weber overhauled these ideas 
so as to avoid assuming the Dirichlet principle, and thereby transformed Riemann- 
Roch into a theorem of algebra. There were attempts to prove the Riemann-Roch 
theorem without appealing to analysis and topology before Dedekind and Weber, 
but these proofs were not completely general, as Dedekind and Weber indicate in the 
first few sentences of their paper. They point out that these previous attempts were 


3Indeed, the concept is still struggling to emerge in Dedekind and Weber’s paper. They use 
the term Schaar for what we call a vector space, but they reprove the basic vector space properties 
for each new Schaar that comes up. Dedekind formally described the vector space properties of 
a Schaar for the first time on pp. 467—468 of Dedekind (1894). 
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made “under certain restrictive assumptions about the singularities of the functions 
under consideration.” This was the case, in particular, for the proof in Brill and 
Noether (1874), where the term “Riemann-Roch theorem” is first employed. For 
more details see Gray (1998). In any case, before discussing Dedekind and Weber, 
we should say a few words on the later development of Riemann’s ideas, and their 
eventual vindication. 


6. Later Development of Analysis on Riemann Surfaces 


Having reached Coutances, we entered an omnibus to go some place 
or other. At the moment when I put my foot on the step the idea 
came to me, without anything in my former thoughts seeming to have 
paved the way for it, that the transformations I had used to define 
the Fuchsian functions were identical with those of non-Euclidean 
geometry. 

From Poincaré’s essay “Mathematical creation” in Poincaré (1918). 


As explained in the previous section, Riemann viewed algebraic curves as sur- 
faces, so that “multi-valued functions” such as \/z became single-valued and the 
genus p had a simple topological meaning. He also interpreted meromorphic func- 
tions in terms of flows of electricity, brought about by applying the poles of a 
battery to points of a surface covered by an infinitely thin layer of conducting ma- 
terial. (In fact, this is apparently where the word “pole” comes from.) An account 
of Riemann’s theory in these frankly unrigorous, but intuitively helpful, terms was 
given by Klein (1882). 

Klein’s book (appearing in the same year as the Dedekind and Weber paper) 
made no advance towards a proof of Riemann’s theorems, since the crucial Dirichlet 
principle was still assumed without proof, and the concept of surface remained 
vague. However, it did inspire other mathematicians to rigorize the questionable 
parts of Riemann’s work. Hilbert (1904) proved a “Dirichlet principle” strong 
enough for Riemann’s needs, and Weyl (1913) completed the theory with a precise 
definition of Riemann surfaces. In fact, Weyl’s definition was soon imitated in all 
parts of complex analysis and differential geometry, where it was useful to have a 
concept of a manifold with differentiable structure, and where one wants to decide 
which structures are isomorphic. 

This was not the only direction in which the concept of Riemann surface devel- 
oped. Another was in generalizing the theory of elliptic functions to higher genus. 
To explain what there is to generalize, I will outline the basic facts about elliptic 
functions and their relation to Riemann surfaces. 

Thanks to the work of Abel and Jacobi, elliptic functions were already well 
known in the 1820s as doubly-periodic functions on C. As we have already men- 
tioned in Section 3, the Riemann surface concept explains the two periods as in- 
tegrals around certain closed curves on the torus, such as C and C2 in Figure 8. 
However, it is also possible to exhibit double periodicity in a formula for an elliptic 
function. The simplest possible formula is one due to Eisenstein (1847): 


1 
fe= 2 (z + mw, + nw)?" 


Assuming that this series is meaningful (which it is, if the summation is interpreted 
properly), then it clearly remains the same if z is replaced by z + wy or z + wo. 
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Thus f has two periods, w; and w2. Almost as simple, and more standard, is the 
pe-function of Weierstrass (1863): 


1 1 1 
@=4+ DS (qe ww) 
z z+ mw, + nw mw, + nw 
om.nyH(0.0) 4 pa NO) 

The Weierstrass function ¢(z) is not so obviously periodic (one first proves this for 
go’), but it is easier to work with because its series is uniformly convergent except 
at the double poles where z = mw, + nw2. For example, Weierstrass was able to 
show, by simple series manipulations, that 


g' (2)? = 40(z)° — 9202) — gs, 


where gg and g3 are certain constants depending on w; and wz. Thus the functions 
x = 9(z) and y = o’(z) parameterize the curve 

y” = 40° — gox — gs, 
to which any curve of genus 1 happens to be birationally equivalent (for suitable 
choice of gz and gs). 

Any function with periods w; and w2, such as g, has values that repeat in each 
parallelogram in the tessellation of the plane C shown in Figure 9. We assume 
that w, and w» lie in different directions from O, so that their integer combinations 
mw, + nw: form a lattice of parallelograms. The points marked by stars, which are 
“equivalent modulo the lattice,” form a set on which all values of the function are 
the same. 


FIGURE 9. Lattice-equivalent points 


Thus ¢ can be viewed as a function on the surface whose “points” are the 
classes of lattice-equivalent points 


ztuiZ4+ wok = {z+ mw, + nw: m,n e€ Z}. 


We call the surface the quotient C/T of the plane C by the group I of translations 
ze z+muw,+ nw for m,n € Z. This Riemann surface, not surprisingly, is a 
torus, obtained by identifying (or “pasting”) opposite sides of the parallelogram 
with vertices 0,w 1, W2,w , + wz, as shown in Figure 10. Thus, we can also arrive at 
meromorphic functions on the torus by starting with suitably periodic functions in 
the plane C. In the years 1880-1882 (when the Dedekind-Weber paper was awaiting 
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FIGURE 10. Construction of a torus by pasting 


publication) a remarkable approach to meromorphic functions on surfaces of genus 
p > 1 was discovered through an exploration of non-Euclidean periodicity, on the 
half-plane {z : Im(z) > 0} or the disk {z : |z| < 1}. 

Isolated examples of functions with striking periodic behavior were discovered 
before 1880, but their periodicity did not have a context or aname. The first picture 
of this new kind of periodicity to appear in print was given by Schwarz (1872), and 
it exhibits the periodicity of a function now known as a Schwarz triangle function. 
The periodicity is indicated in Figure 11 via a tessellation of the disk by curvilinear 
triangles, in each of which the function repeats its values. 
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FIGURE 11. The Schwarz tessellation 


The triangles are of course not congruent in the euclidean sense, but in 1880 
Poincaré realized that they are congruent in the sense of the non-Euclidean geom- 
etry of Bolyai and Lobachevsky. Following this discovery, Poincaré (1882) brought 
non-Euclidean geometry into mainstream mathematics, and unveiled a new view of 
functions on Riemann surfaces—as functions with non-Euclidean periodicity. 

The relationship of the disk to a Riemann surface S of genus p > 1 is like that 
of the plane to the torus: S is a quotient of the disk by a discrete group of (non- 
Euclidean) translations, and the disk is tessellated by congruent copies of a polygon 
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obtained by suitably cutting the surface. The simplest case is the surface of genus 
2, which can be cut along the curves shown in Figure 12 to form an octagon whose 
corner angles sum to 27. 


ay 


a2 
FIGURE 12. Dissection of genus 2 surface 


The natural home of such an octagon is the non-Euclidean plane, which can be 
modeled by the unit disk with circular arcs orthogonal to the boundary circle as 
“lines.” In particular, we can arrange eight such arcs so as to form an octagon with 
corner angles 7/4, and (non-Euclidean) translations of this octagon fill the whole 
disk as shown in Figure 13. 


FIGURE 13. Tessellation of the disk by octagons 


We can then recover the surface S of genus 2 as the quotient of the disk by 
the group [ of all translations that map the tessellation onto itself. Finally, mero- 
morphic functions on S correspond to meromorphic functions f on the disk with 
periodicity given by T, that is, functions with the property that 


f(z) = f(g(z)), for any translation g ET. 


Thus we are reduced to the problem of constructing meromorphic functions f on 
the disk with periodicity given by [. This problem was solved by Poincaré (1883), 
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using insights from the theory of elliptic functions. The solution builds all elements 
g €T into f, though not so simply as Eisenstein (1847) built Euclidean translations 
into the definition of a doubly-periodic function. We omit the details. 

Poincaré’s construction of functions with non-Euclidean periodicity (Fuchsian 
functions, or automorphic functions as they became known) was a spectacular 
extension of the theory of elliptic functions to surfaces of higher genus. At the same 
time, it showed that elliptic functions and the torus are special, being associated 
with Euclidean geometry, and that the general Riemann surface should be viewed as 
non-Euclidean. However, like Riemann, Poincaré assumed an unproved theorem— 
the so-called uniformization theorem—to give his results their most general form. 

Just as Riemann needed an assumption (the Dirichlet principle) to prove that 
any surface of genus 0 is isomorphic to the sphere C U {oo}, Poincaré needed to 
assume that any Riemann surface of genus p > 1 is isomorphic to a quotient of the 
disk by a discrete group of non-Euclidean translations. The uniformization theorem 
encompasses both these theorems by stating that any simply connected Riemann 
surface is isomorphic to either the sphere C U {oo}, the Euclidean plane C, or the 
unit disk {z : |z| < 1}. Uniformization also extends the known parameterization 
theorems for algebraic curves of genus 0 (parameterization by rational functions) 
and genus 1 (parameterization by elliptic functions), because it implies that any 
algebraic curve of genus > 1 may be parameterized by automorphic functions. 

Poincaré’s work was clearly a brilliant extension of Riemann’s ideas, but it 
remained under a cloud for some time because of doubts about its rigor. Indeed 
his papers of 1882 and 1883 were not published by the leading journals of the time; 
instead, they played a big part in establishing the new journal Acta Mathematica.* 
The doubts were eventually dispelled with proofs of the uniformization theorem by 
Poincaré (1907) and Koebe (1907), and by the rigorous theory of Riemann surfaces 
developed by Wey] (1913). Thus, it took more than 50 years for the analytic theory 
of algebraic curves to be fully accepted, and perhaps this worked to the advantage of 
algebra and algebraic geometry. The algebraic restructuring of Riemann’s concepts 
by Dedekind and Weber may not have taken place had an analytic alternative been 
available, and we might thereby have missed a radically new insight into the nature 
of algebraic curves. The first textbook development of the algebraic theory was 
Hensel and Landsberg (1902), and then only with a reversion to certain ideas from 
analysis, such as infinite series expansions. For an English version of the Hensel- 
Landsberg approach, with some simplifications, see Bliss (1933). 

Even today, it is a little strange to think that number theory inspired the first 
complete and rigorous proof of the Riemann-Roch theorem. Nevertheless, if one 
looks at the proof of the theorem in any modern book one will see that it involves 
things called “divisors.” A divisor is nothing but a finite set of points with attached 
integer multiplicities—like a set of zeros and poles—yet it sounds as if the concept 
comes from number theory. In the next sections we explain how this happened. 


4Looking back to his founding of Acta Mathematica, Mittag-Leffler (1923) recalled: 


Kronecker, for example, expressed to me via a mutual friend his regret that 
the journal seemed bound to fail, without help, through publishing a work 
so incomplete, unripe, and obscure. 
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7. Origins of Algebraic Number Theory 


It is greatly to be lamented that this virtue of the real numbers |i.e., 
the rational integers] to be decomposable into prime factors, always 
the same ones for a given number, does not also belong to the complex 
numbers; were this the case, the whole theory, which is still laboring 
under such difficulties, could easily be brought to its conclusion. For 
this reason, the complex numbers we have been considering seem im- 
perfect, and one may well ask whether one ought to look for another 
kind which would preserve the analogy with the real numbers ... 
Translation by Weil (1975) from Kummer (1844). 


The concept of divisibility has been fundamental in number theory for more 
than 2000 years. Euclid’s Elements, Book VII, introduces the Euclidean algorithm 
in Proposition 1, and shows that it yields the greatest common divisor of two 
numbers in Proposition 2. Eventually Euclid deduces, in Proposition 30, that if a 
prime p divides a product ab then p divides a or p divides b. This prime divisor 
property easily yields what we now call the fundamental theorem of arithmetic: 
each natural number has unique prime factorization (up to the order of factors). 

Unique prime factorization was first explicitly stated as a theorem in Gauss 
(1801), but before then it was frequently assumed without comment. Indeed, unique 
prime factorization was sometimes assumed for numbers seemingly far from the 
natural numbers. One of the first instances was the spectacular proof of Euler 
(1770), p. 401, that the only natural number solution of 


y =a? +2 


is c = 5, y = 3, a result that had been claimed by Fermat. To prove this result, 
Euler factorized the right side of the equation, obtaining 


y= (e+ V—-2)(a— v—2), 


and then worked with numbers of the form a + b/—2 (for integers a and 6) as if 
they were ordinary integers. In particular, he argued that «+ /—2 and x — /—2 
are relatively prime and therefore (apparently assuming unique prime factorization) 
that each is a cube, because their product is a cube. But if 


g +/—2 = (a+ bvV—2)° 
it follows that 
z= a> = Gab? =a(a? — 6b"), 1 = 3a7b— 26° = b(8a? — 2). 


The only integer solutions of the latter equation are a = +1, b = 1, of which only 
a =—1, b=1 give a natural number solution, namely x = 5. 

This is magnificent, but is it number theory? 

Evidently, we need to rebuild the theory of divisibility and primes for new types 
of numbers, such as those of the form a + b/—2. The first to take steps in this 
direction was Gauss (1832), who worked out the basic theory of what we now call 
the Gaussian integers, Z[i]. These are the numbers of the form a+ bi, where a and 
b are integers and i = /—1. 

We now sketch the divisibility theory of Z[¢], since it admits a visual interpre- 
tation that will be helpful in other cases. The only result we really need to prove is 
the division property (sometimes called the “division algorithm”): if a and 8 40 
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are Gaussian integers, then there are Gaussian integers 4 and p such that 
a= ph+p, where |p| < (Bl. 


If we can prove this, then there is a Euclidean algorithm, and unique prime factor- 
ization in Z[7] follows as it does for ordinary integers. 

Well, for any Gaussian integer 8 4 0, consider the multiples u6 of 6 by all 
the Gaussian integers yw. These are the integer combinations of the perpendicular 
vectors 6 and i8, which form squares of side length |6|, as shown in Figure 14. 


Se 


FIGURE 14. Multiples of 8 in Z/i] 


The Gaussian integer a falls in one of these squares, so if uG is the nearest 
corner we have 


lol <|6], where p=a-pp, 


because the distance of a point in a square from the nearest corner is less than 
the side length. This establishes the division property, and hence unique prime 
factorization in Z/i]. 

The argument is similar for the set Z[,/—2] of numbers of the form a + b/—2 
considered by Euler. The multiples 8 of such a number ( form a grid of rectangles, 
with sides of length || and |6|/2, but it remains true that the distance of a 
point a from the nearest corner is less than |3|. Thus the division property holds 
for Z[,/—2], hence a Euclidean algorithm and unique prime factorization, so the 
main assumption of Euler’s proof is valid. (The other assumption, about relative 
primality, is also easy to justify.) 

Many results about ordinary integers are most easily proved by passing to 
“quadratic integers,” such as Z|2], and appealing to unique prime factorization. For 
example, consider the prime 797, which is the sum of two squares 2674+ 112. Is this 
sum of squares unique? Yes, because if 797 = a? +b? we have 797 = (a+bi)(a—bi), 
which is a prime factorization because |a + bi|? = |a — bil? is the ordinary prime 
a? +b? = 797 (so neither a+ bi nor a— bi is a product of smaller Gaussian integers). 
Then, since Gaussian prime factorization is unique, so is the decomposition of 797 
into a sum of squares. 
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Unfortunately, there is trouble not far ahead. If we wish to prove theorems 
about natural numbers of the form a? + 5b? by using the factorization 


a? + 5b? = (a + bV—5)(a — bV—5) 


we cannot assume unique prime factorization, because it fails for the set Z[,/—5 
of numbers of the form a+b/—5. The classical example, from Dedekind (1877), is 


6=2-3=(1+V7—5)(1—V/-5). 


Each of the numbers 2, 3, 1 + /—5, 1 — V—5 is “prime,” in the sense that none 
of them is a product of smaller numbers in Z[,/—5], so we have nonunique prime 
factorization in Z[,/—5|. Gauss seemed to be aware that unique prime factorization 
could fail in cases like this; his theory of quadratic forms in Gauss (1801) seems 
designed to avoid such problems, sometimes at considerable expense. 

However the first to note the phenomenon in print, and to declare that some- 
thing should be done about it, was Kummer (1844) (see the quote at the beginning 
of this section). In a bold attempt to rescue unique prime factorization, Kummer 
introduced what he called “ideal numbers.” The name was apparently motivated 
by the “ideal” elements that were then becoming accepted in geometry, such as 
points at infinity, but the concept drew inspiration from Jacobi’s work on number 
theory in the 1830s, as has been shown by Lemmermeyer (2009). In Kummer’s 
work, “ideal numbers” arise in a rather complicated way, and we will instead follow 
Dedekind (1877) and explain how they work in Z[/—5]. 

To reconcile the the factorizations 


6=2-3= (1+ V/—5)(1— V-5) 


with unique prime factorization, we have to believe that 2, 3, 1+ /—5, 1— /—5 
are not primes, after all. They must somehow split into smaller, “ideal,” factors— 
but what are these ideal factors? A candidate that comes to mind is the greatest 
common divisor of 2 and 1+./—5. We cannot put our finger on any such number 
but, by borrowing an idea from classical number theory, we can describe the set of 
multiples of a greatest common divisor. In classical number theory, the multiples 
of gcd(a, ) are all the numbers of the form ma+nb. In Z[/—5], there is no actual 
number that can serve as gcd(2, 1+ /—5), but the set 


{2u+ (14+ V—5)v : pv € Z[V—5]} = {2m 4 (14+ V—5)n: m,n € Z} 


is perfectly concrete, and it can serve as the set of “multiples of the ideal number 
gcd(2, 1 + /—5).” 

Figure 15 shows what this set looks like. Its members are the black dots 
on the grid of points in Z[,/—5]. Notice that these black dots do not form a 
grid of rectangles, as they would if they were multiples of an actual number in 
Z|\/—5|—because the multiples of a number a are simply the rectangular grid 
Z{/—5], magnified by |a| and rotated through the argument of a. 

The number ged(2, 1+ /—5) is “ideal,” but the set of its multiples is perfectly 
real, and visible. When we admit “ideal numbers,” via their sets of multiples, 
it turns out that there are “ideal primes,” and that each number in Z[,/—5] fac- 
torizes uniquely into ideal primes. This is how Kummer recovered unique prime 
factorization, and hence preserved the analogy with the ordinary integers. 
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FIGURE 15. Multiples of the ideal number gcd(2, 1+ /—5) in Z[V—5] 


8. Dedekind’s Theory of Algebraic Integers 


The great success of Kummer’s researches in the domain of circle di- 
vision allows us to suppose that the same laws hold in all numerical 
domains .... I did not achieve the general theory ... until I abandoned 
the old formal approach and replaced it by another; a fundamentally 
simpler conception focussed directly on the goal. In the latter ap- 
proach I need no concept more novel than that of Kummer’s ideal 
numbers, and it is sufficient to consider a system of actual numbers 
that I call an ideal. 

Dedekind (1877), pp. 56-57. 


Kummer was interested in a particular class of numbers, now known as cyclo- 
tomic integers, most famously for their application to Fermat’s last theorem. He 
developed his theory of “ideal numbers” only for the cylotomic integers, and it was 
left to Dedekind to develop a general theory of algebraic integers, and what we now 
know as the theory of ideals. Dedekind’s first account of this theory appeared in 
Dedekind (1871), an appendix to Dirichlet’s Vorlesungen tiber Zahlentheorie, which 
Dedekind edited. This did not attract as much interest as Dedekind hoped, and 
he produced a more down-to-earth exposition of the theory in Dedekind (1877). I 
have drawn on the latter version here. 

From a handful of examples such as Z/i], Z[,/—2], and Kummer’s cyclotomic 
integers, Dedekind (1871) synthesized a general concept of algebraic integer. Then, 
by properly situating algebraic integers in the broader context of algebraic numbers, 
he was able to define ideal prime numbers and prove the uniqueness of ideal prime 
factorization. In outline, his train of thought was the following. 
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Begin with the algebraic numbers. A number a is algebraic if it is the root of 
an equation 


Ant” + Qn_10" 1 +---+a,2+a9 =0, where ao,a1,...,@n € Z. 


Among the algebraic numbers, those satisfying equations of this form in which 
Gn = 1 (monic equations) are called algebraic integers. It can be proved (not quite 
trivially) that the algebraic numbers are closed under the operations +, —, x, + (by 
a nonzero number), and hence they form a field, and that the algebraic integers 
are closed under the operations +, —, x, and hence they form a ring. The concept 
of algebraic integer is compatible with the ordinary concept of integer because any 
rational solution of a monic equation is an ordinary integer. This fact was already 
mentioned by Gauss (1801), article 11. 

For these reasons, and others that will appear below, the concept of algebraic 
integer is a good generalization of the ordinary integer concept. This is not com- 
pletely obvious, because there are algebraic integers that do not “look integral.” 
One might think that the cube root of 1, 

-1+/-3 

(=. 
should not be regarded as integer because of its “fractional” appearance. Yet it is 
a root of the monic equation, x? + «+1=0. And in fact it is better to work in 
the ring 

Z|¢3] = {a + b¢3 : a,b € Z} 
than in the ring 
Z[V—3] = {a + bV—-3 : a,b € Z}, 

because Z[¢3] has unique prime factorization and Z[,/—3] does not. 

However, it is not a good idea to work with the ring of all algebraic integers, 
because it cannot possibly have unique prime factorization. This is because any 
algebraic integer a has a factorization 


a= Vaya, 


and \/a@ is also an algebraic integer. Dedekind (1871) saw that the way to avoid 
this problem is to work in an algebraic number field F of finite degree over Q, and 
with the ring of algebraic integers in F. Any such field has the form® 


Q(a) = {p(a)/q(a@) : p,q polynomials with integer coefficients}, 


for some algebraic number a. The degree of a (that is, the degree of the minimal 
polynomial satisfied by a) is called the degree of F = Q(a), deg(F). 
Each algebraic integer 6 in F has a minimal polynomial 


xo” + bya” 1 +--+ 4+ da + bo, 
where n < deg(F), and bo,...,b,-1 are ordinary integers. The ordinary integer 


(—1)"bo is called the norm of 8, N(8). N(@) is the product of 8 with all the 
other roots of the minimal polynomial, called the conjugates of 8. For example, 


5Dedekind actually defines a field F' of finite degree n over Q more generally, as a field of 
dimension n as a vector space over Q. But it is true that there is always a primitive element a 
such that F = Q(a). This result, essentially due to Galois, also applies to the function fields that 
appear later, so we will also suppose each of them to be generated by a primitive element. 
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a Gaussian integer 6 = a+ bi has one conjugate, the ordinary complex conjugate 
Bp =a-— bi, and 

N(8) = (a + bi)(a — bt) = a? +.B? = [B)?. 
The familiar property of absolute value, |a3| = |a||6|, generalizes to the multiplica- 
tive property of norm: 

N(aB) = N(a)N(8). 

It follows from the multiplicative property that, if y = a@G, then we have N(y) = 
N(a)N(G). This reduces certain questions about divisibility of algebraic integers to 
questions about divisibility of ordinary integers. For example, if a divides y, then 
N(a) divides N(7¥). It follows that the process of factorizing an algebraic integer + 
must eventually terminate with a factorization 


Y = A10Q°°* Ak, 


where N(a1), N(az),...,N(a%) are ordinary primes. 
Thus any integer y of F' has a factorization into integers a1,Q2,...,Q@p, that 
are irreducible in the sense that a; is not the product of integers of smaller norm. 
We know from the example 


6=2-3= (14+ V—5)(1— V—5) 


in Z[\/—5], which is the ring of integers of Q(/—5), that factorization into irre- 
ducibles is not always unique. But we also know that unique prime factorization 
can be recovered, in this case, by further splitting into “ideal numbers.” Dedekind 
generalized this recovery program with his concept of ideals. An ideal a is a set of 
integers of a field F' with the following closure properties: 

1. If@e€aanda’ €athena+a’ €aanda-—a’ Ea. 

2. Ifa €aand yp is any integer of F, then pra € a. 
These properties capture the idea of a “set of integer multiples” of an integer, actual 
or ideal. 

If @ is an integer of F' then its set of multiples 


(a) = {wa : pw an integer of F} 
certainly has the properties of an ideal. We call (a) the principal ideal generated 
by a. The set in Z[/—5], 
{2u + (1+ V—-5)v: pv € Z[/—5]} = {2m4+ (14+ V—5)n: m,n € Z}, 


also has the required closure properties, so it is also an ideal. But, as we have seen, 
this ideal is not the set of integer multiples of an actual integer in Q(,/—5)—it is a 
nonprincipal ideal. 

If a and 6 are ideals, Dedekind (1871) defines their product by 


ab = {a1 81, +++» +QmBm 2 01,---;Am €aand §1,...,8m € b}. 


This definition agrees with the natural product (a)(8) = (a8) when a = (a) and 
b = (8) are principal ideals. Finally, Dedekind (1871) says that b divides aifbDa 
(“to divide is to contain”). This also agrees with the natural concept of division for 
principal ideals. For example, 2 divides 6, so naturally (2) divides (6), and indeed 
(2) > (6) because 


i) 


,£6,...} D {0,+6,+12,...} = (6). 
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Up to this point, the theory is a straightforward extension of the usual divisi- 
bility concept to ideals, if one views ideals as sets of multiples. But, as Dedekind 
(1877), §23, points out: 


It is very easy to see (§22,1) that any product of a by an ideal 6 is 
divisible by a, but it is by no means easy to show the converse, that 
each ideal divisible by a is the product of a by an ideal b. 


Eventually, Dedekind overcomes the difficulty by generalizing the theory of divisi- 
bility to more general subsystems of integers of F' called orders, and unique prime 
ideal factorization follows. 

We will not go into further detail, because the details are somewhat different 
in the case of algebraic function fields, as Dedekind and Weber point out. We only 
wish to emphasize how important it was to be aware of the notion of ideal. The 
theory of algebraic functions would not have been discovered without this awareness 
of ideals, or something similar. 

There is in fact an alternative to the concept of ideal, namely, the concept 
of divisor developed by Kronecker. For the full story, see Edwards (1980). Kro- 
necker’s work overlaps with Dedekind’s and in fact his major paper, Kronecker 
(1882), appears in the same issue of the journal containing the Dedekind-Weber 
paper (probably not by accident,° since Kronecker was an editor of the journal). 
Kronecker’s paper surpasses the Dedekind-Weber paper in some ways, by consider- 
ing functions of several variables, but it falls short on specifics and applications, such 
as proofs of the Abel and Riemann-Roch theorems. For this reason, and its greater 
readability, the Dedekind-Weber paper has been far more influential. Nonetheless, 
we must give credit to Kronecker for his divisor concept. The word “divisor” is 
the one that has passed into algebraic geometry today, in place of what Dedekind 
and Weber called a “polygon.” In the next section we explain how the concept of 
divisibility made its way from algebraic number theory to algebraic geometry. 
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Dedekind and Weber propose to give algebraic proofs of all of Rie- 
mann’s algebraic theorems. But their remarkable originality (which 
in all the history of algebraic geometry is only scarcely surpassed by 
that of Riemann) leads them to introduce a series of ideas that will 
become fundamental in the modern era. 


Dieudonné (1985), p. 29. 


If we had to explain the nature of algebraic number theory in a nutshell, we 
could say that it is the result of generalizing the theory of Z C Q (the “rational 
integers”) to the integers in an extension field Q(a) of Q, where a is an algebraic 
number. In particular, one tries to generalize the theory of divisibility and primes, 
with a view to retaining, or recovering, unique prime factorization. The search for 
suitable “primes” leads to the discovery of the key concept of ideal. 

The 19th-century theory of algebraic functions begins with the analogous theory 
of polynomials in the field C(x) of rational functions of « with complex coefficients, 
and it similarly studies extensions of C(x). So first we should see to what extent 
the polynomials behave like “integers” in the field of rational functions. 


8Indeed, Kronecker delayed publication of the Dedekind-Weber paper for over a year, until 
his paper could appear at the same time. See Edwards (1980), p. 370. 
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Certainly, the polynomials form a ring. Also, it has been known since Stevin 
(1585) that they have a division property and hence they admit a Euclidean algo- 
rithm. The division property takes the following form: if f(x) and g(x) # 0 are 
polynomials, then there are polynomials q(«) and r(x) (“quotient” and “remain- 
der”) such that 


f(x) = g(w)a(a) +r(a), where deg(r) < deg(g). 


Thus the remainder r(x) is “smaller” than g(x) when measured by its degree. The 
Euclidean algorithm therefore terminates, and yields the gcd of polynomials f(z) 
and g(x), in a number of steps bounded by the degree of g. It follows by the 
usual steps that unique prime factorization holds for the ring C[2] of polynomials 
in C(x). To be precise, prime factorization is unique up to nonzero constant factors. 
Moreover, we can say exactly what the prime polynomials are: by the fundamental 
theorem of algebra, they are the linear polynomials x — c, for complex numbers c. 

Now the field C(x) of rational functions is, as we saw in Section 4, the field 
of meromorphic functions on the sphere C U {00}. The primes 2 — c of this field 
correspond to the points c of C. So, in a sense, we can recover the points of the 
surface on which the field C(x) “lives” from the primes of the field itself. (Admit- 
tedly, we are still missing the point oo, but presumably we can squeeze it out of 
C(a) somehow.) 

This seemingly retrograde idea—defining a Riemann surface from the functions 
on it—is one of the most original and prescient ideas of the Dedekind-Weber paper. 
But before we can explain how it works, we must say more about function fields, in 
order to show how a function field may be plausibly related to a Riemann surface 
in the first place. As mentioned in Section 4, the connection arises from Riemann’s 
discovery that the meromorphic functions on a Riemann surface are algebraic, that 
is, solutions of polynomial equations. It is also clear that the set of all meromorphic 
functions on a given Riemann surface forms a field, because it is obviously closed 
under the operations of +, —, x, and + (by a nonzero function). 

Now, if we pursue that idea of constructing algebraic function fields in analogy 
with algebraic number fields, then we should extend C(x) to C(x)(y), where y (a 
“primitive element” for the field) satisfies an equation of the form 


any” +an_1y” 1 +++:+a1y+a9=0, where ag,a1,...,4n € C(z). 


The elements of C(a) are rational functions, that is, quotients of polynomials. By 
multiplying by their common denominator we can therefore rewrite the equation 
satisfied by y in the form 


f(x,y) = 9, 
where f(x,y) is a polynomial. In other words, y is defined by the equation of an 
algebraic curve, and hence of a Riemann surface. The function field generated by 
y consists of the rational functions in y, 
C(x) (y) = {p(y)/¢(y) : p, ¢ polynomials with coefficients in C(x)} 
= {rational functions in x,y such that f(x,y) = 0} 


= C(z,y)/(f(z,y)), 


which is called the field of rational functions on the curve f(x,y) = 0, or on the 
Riemann surface f(x,y) = 0. Thus a finite-degree extension of C(x) is naturally 
viewed as the field of rational functions on a Riemann surface. 
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Exactly why the field © rational functions in x,y such that f(x,y) = 0 should 
be called “the rational functions on the curve f(z,y) = 0” is seldom explained. 
One of the few authors who bothers to do so is Walker (1950), p. 132 (by “a point 
of f” he means a point of the curve f(x,y) = 0): 

If gi(x,y)/hi(a,y) and go(x,y)/he(x,y) are rational functions 

and (a,b) is any point of f at which these functions have 
values, then these values will be equal if gi(€,7)/hi(é,7) = 
g2(€,7)/ho(€,7). Conversely, if this last equality does not hold, 
then f(x,y) does not divide gi (x, y)ha(x,y) — go(x, y)hi(z, y), 
and there are points of f at which the two rational functions 
assume different values. In other words, as far as the points of f 
are concerned the rational functions behave like elements of &. 


We now pause to look at some examples of finite-degree extensions of C(x). 
A genus 0 example. The rational functions on x? + y? = 1. 


This curve is just the complex version of the circle, which is well known to have 
a parameterization by rational functions: 


t= —— 

1+’ 

(This result is equivalent to Euclid’s parameterization of Pythagorean triples, given 
in a lemma following Prop. 28 in Book X of the Elements.) 

A rational function of x and y is therefore a rational function of t, so our 

function field in this case is a subfield of C(t). In fact our field is exactly C(t), 


because 
x 


1+y 
so any rational function of ¢ is a rational function of x and y. 

Thus, the field of rational functions on 2? + y? = 1 is the same as the field of 
rational functions on C U {oo}. If we hope to recover each Riemann surface from 
the field of rational functions on it, the Riemann surface x? + y? = 1 therefore 
needs to be the “same” as the sphere CU {oo} in some sense. The equations above 
show that x? + y? = 1 and CU {co} are indeed the “same” in the following strong 
sense: they are birationally equivalent; that is, there is a one-to-one correspondence 
(x,y) @ t that is rational in both directions. 


z 


This notion of equivalence was discovered by Riemann (1851), who showed a 
much stronger result, already alluded to in Section 4. Assuming the Dirichlet prin- 
ciple, he showed that any Riemann surface of genus zero is birationally equivalent 
to the sphere C U {oo}. A simpler fact about birational equivalence is that alge- 
braic curves f(x,y) = 0 and g(a,y) = 0 are birationally equivalent if and only if 
their function fields are isomorphic, which is an easy generalization of the example 
above. 


A genus 1 example. The rational functions on y? = 1 — 2%. 


The Riemann surface y? = 1 — 24 is not birationally equivalent to the sphere, 
because it has no parameterization « = f(t), y = g(t) by rational functions f 
and g. This is the result foreseen by Jakob Bernoulli (1704) that we mentioned in 
Section 2. It is worth saying a little more about it now, because it nicely underlines 
the analogy between polynomials and integers. 
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Suppose on the contrary that there are rational functions x(t) and y(t) with 
y(t)? =1- a(t)! 


Writing x(t) = p(t)/r(t) and y(t) = q(t)/r(t) as quotients of polynomials with a 
common denominator we get the equation 
g(t)*r(t)? = r(t)* — pt)’, 
and hence 
s(t)? = r(t)* — p(t)’, 

for certain polynomials p(t), r(t), and s(t). 

Now, as Jakob Bernoulli noticed, there is a theorem of Fermat that the equation 
s? = r+ — p* is impossible for nonzero integers p,r,s. This does not rule out the 
equation for nontrivial polynomials p,r,s, but Bernoulli was nevertheless on the 
right track. One has only to imitate Fermat’s argument, using polynomials in place 
of integers. The two domains are sufficiently similar that the arguments carry over. 

For example, the starting point of Fermat’s argument is essentially the fact 
mentioned in the previous example: if X,Y are rational numbers such that 

X?+Y?=1, 
then we can write 
2T 1-—T? ; 
=the = TP for the rational number T = ia 

In our case we have rational functions X(t), Y(t) such that X(t)? + Y(t)? =1, and 
exactly the same calculation shows that 


2 
= wie (t) = eran for the rational function T(t) = THY: 
The rest of the argument may be similarly rewritten, using divisibility of polyno- 
mials in place of divisibility of numbers where appropriate. 

Thus the polynomial version of Fermat’s argument shows that the Riemann 
surface y? = 1 — a+ is not birationally equivalent to the sphere. It follows, by the 
remark after the previous example, that the field of rational functions on y? = 1—a# 
is not isomorphic to C(zx).7 

The function field of y? = 1 — x* therefore reflects the difference between this 
genus 1 curve and the genus 0 curve C U {oo}. In fact, it reflects more than the 
difference in genus, because the genus 1 curves actually fall into infinitely many 


X(t) X(t) 


"The function field of y? = 1 — 4 in fact equals C(sl(t), sl’(t)), where sl is the lemniscatic 

sine function defined, as in Section 2, by 
waste) = f° a 
0 Vl—at 


This is because the curve y? = 1 — 2+ has the parameterization 


x=sl(u), y=sl'(u). 


The definition of sl—! of course gives 2 = sl(u), and differentiation gives 


du _ 1 ac ab 
dx T-a y? 
whence j 
ha 
sl/(u) = — = 
er 


4 


Thus the rational functions on y? = 1 — «+ are generated from the elliptic functions sl and sl’. 
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birational equivalence classes. This follows from a theorem of Salmon (1851), if one 
suitably reinterprets Salmon’s result. He stated his result in terms of projective 
equivalence, which happens to be the same as birational equivalence, and also as a 
result about curves of degree 3 rather than curves of genus 1. The curve y? = 1— «+ 
is not itself of degree 3, of course, but it is birationally equivalent to the cubic curve 
Y? =4X°—6X°+4X-1 
under the substitution 
1 y 
Ota Comer 

The correspondence between birational equivalence classes of curves and iso- 
morphism classes of their function fields is now taken for granted—so much so that 
certain theorems that were discovered as results about birational equivalence are 
now stated (without comment) as theorems about function fields. A well-known 
example is the so-called Liiroth’s theorem. The original statement of the theorem, 
in Liiroth (1875), is that any curve parameterized by rational functions can be pa- 
rameterized bijectively by rational functions (with the exception of finitely many 
points, for example, if the curve has self-intersections). A typical modern statement 
of the theorem is that any subfield of C(z) containing more than C is isomorphic 
to C(z). 

The equivalence of these two statements is not obvious, though not very hard 
to prove either. For an elementary proof that the modern form of the theorem 
implies the original form, see Shafarevich (1994), pp. 9-10. 
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These prime ideals correspond to the linear factors in the theory of 
polynomials. On this basis one attains a completely precise and gen- 
eral definition of a “point of a Riemann surface,” i.e., a complete 
system of numerical values that can be consistently attached to the 
functions of the field. 

Dedekind and Weber (1882), Introduction. 

The basic idea of Dedekind and Weber is very natural: a point a on a Riemann 
surface S gives a value f(a) to each rational function f on S, and the values given 
to different functions f and g are consistent in the sense that 

e The value given to a constant function c is c. 

e The value given to f +g is f(a) + g(a). 

e The value given to f —g is f(a) — g(a). 

e The value given to f x g is f(a) x g(a). 

e The value given to f +g is f(a) + g(a). 
When the values include oo, as they necessarily do for rational functions, one has 
to make conventions such as 1/oo = 0 and 1/0 = ov, but this is not a serious 
problem. The basic question is: does each consistent assignment of values to the 
rational functions on a surface S arise from a unique point of S? If so, we can 
reverse the original idea: namely, start with a function field F’, and define points of 
a surface Sp for which F is the field of rational functions. Just say: a point P € Sp 
is an assignment of values (from C U {oo}) to the functions in F’ that satisfies the 
consistency conditions above. 
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A genus 0 example. The rational function field C(z). 


The idea that each point comes uniquely from an assignment of values to func- 
tions is certainly valid when the function field is C(x). In this case a consistent 
assignment of values to the functions in C(x) is determined by the values assigned 
to the prime functions « — c, where c € C. To be consistent with subtraction and 
the values of constant functions, each function x — c must be given the value a—c, 
for some fixed a € CU {oo}. But this is exactly the assignment given by the point 
x =a. Thus, for the function field of the surface S = C U {co}, each consistent 
assignment of values arises from a point a € S. 


We can also see that any point P’ of the surface S for an algebraic function field 
F lies “over” a point of the sphere CU {oo}. Why? Because F is a field containing 
C(x) and P’ is a function F + CU {co}. Thus, if P is simply the restriction of 
the map P’ to the subfield C(x), then P is a point of the surface for C(«), which 
corresponds to a point of C U {oo}, as just explained. 

Moreover, if F' is a field of degree n, then there are in general n points P’ 
over a given point P (so the surface is an “n-sheeted covering” of the sphere). If P 
corresponds to the value x = xo, consider a primitive element y € F’, which satisfies 
an equation® 

y” + an—1(x)y”* +++» +a1(x)y + ao(x) = 0, 
where ao(Z),...,@n—1(x) € C(x). The n points over P are the n assignments of 
values to the functions in F rising from the n values of y satisfying the equation 


y” + an—1(20)y" > + +++ + a1(x0)y + a0(z0) = 0. 


Thus the surface defined by Dedekind and Weber makes a good start in capturing 
the properties of a Riemann surface, and we can imagine identifying branch points, 
and so on. But where do ideals come in? 


When we extend C() to an algebraic function field F’, the analogy with alge- 
braic number fields suggests that we should first define the “integers” of F’, then the 
ideals. Nowadays, one generally skips the “integers” and goes straight to ideals, 
and beyond them to “divisors.” We come to divisors shortly, but we will follow 
Dedekind and Weber by defining the “integers” first. The “integers” of C(x) are of 
course the polynomials in x, so the ring or of “integers” of F' should be the integral 
closure of the ring C[z] of polynomials with complex coefficients. This ring consists 
of the functions y € F’ satisfying equations of the form 


y” + bp-1y"* +++» + bry + bo = 0, 


where bo,...,bn—1 € C[z]. Dedekind and Weber call these ganz Functionen, which 
we will translate as integral algebraic functions. They show in §3 that any element 
of F is a quotient of elements of op, in fact the quotient of an element of of by a 
polynomial. 

Ideals in of are then defined as in any ring, by the closure properties given in 
Section 7. A prime ideal p is divisible only by itself and (1) = o-, so (since “to 
divide is to contain”) p is maximal. That is, if y is any integral algebraic function 
in or \ p, the only ideal containing both y and p is op itself. 


8Incidentally, assuming this is the minimal polynomial for y, the rational function ao(x) is 
the norm of y. As in algebraic number theory, we can use properties of the norm to deduce 
properties of y. For example, since the total order of zeros and poles of a rational function is 0, 
the same is true of the total order of zeros and poles of y. 
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Having observed that the points of CU {oo} correspond to the primes of C(x), 
we may expect the points of Sr to correspond to “primes” of fF. This is where 
ideals come in: the points of Sp not at co correspond to the prime ideals of the ring 
op of integral algebraic functions of F. 

Namely, for each noninfinity P € Sp, let 


pp ={f €or: f(P) =9}. 


(Notice that p. is empty because f(co) = 0 implies f ¢ op.) It is clear that pp is 
an ideal, because it is closed under sums and differences, and under product with 
arbitrary entire functions. And pp is maximal, and hence prime, because if we 
adjoin to pp any integral algebraic function g not in pp the resulting ideal is the 
whole of. This is because g(P) 4 0, since g ¢ pp, but obviously g — g(P) € pp. 
An ideal containing both g and g — g(P) contains the constant function with value 
g(P), so it is the whole of or. Thus pp is maximal and hence prime. 

Conversely, each prime ideal p in Or equals pp for some point P ¥ oo. 

To see why, let p be a prime ideal, and assign the value 0 to each f € p. 
By the closure properties of ideals, this is a consistent assignment of values to all 
f € p. Since p is prime, and hence maximal, we can extend this assignment of 
values consistently to all of or by adjoining a constant function with nonzero value 
c. The assignment then extends consistently to all functions in F' by writing them 
as quotients of integral algebraic functions, and hence it defines a point P 4 oo 
(because, as we observed above, functions f with f(oo) = 0 are not integral). 
Finally, since we assigned the value 0 to all f € p, we have 


p={f €or: f(P) =0}=pp. 


So far, so good. However, the integral algebraic functions and their prime ideals 
do not tell the whole story of functions on a Riemann surface. The problem lies 
in the presence of the point co on the Riemann sphere. As we saw in Section 4, 
the compactness of C U {oo} is crucial in obtaining a nice field of meromorphic 
functions (the rational function field C(x)). But the presence of oo means that the 
function w = 1/az is just as good as the function x. It is therefore just as good to 
view the members of C(x) as functions of 1/x as it is to view them as functions of 
xz. So the “integral functions”—the polynomials in z—do not have privileged role 
in the theory of rational functions. 

Indeed, we have already seen that zeros and poles should be treated equally 
for rational functions, and the same applies to algebraic functions in general. We 
therefore need a concept that captures the possible zeros and poles of an arbitrary 
algebraic function, and this is the concept now called a divisor. 

A divisor D on a Riemann surface S can be written in the (multiplicative) form 


@ = pm pm... pms. 


where P,,..., Px are points of S and m,..., mx are nonzero integers. The idea is 
that if m; > 0 then P’™ encodes a zero of order m; at P; of some function f. In 
other words, in a neighborhood of P,;, f(x) = (a—P;)"E, where E is approximately 
constant and nonzero. When mj, < 0, P; is a similarly defined pole of order —m,. 
For the sake of uniformity, we simply say that P; is a point of order m;, so points 
of positive order are zeros and points of negative order are poles. 

A divisor is called principal (analogous to a principal ideal) if it is realized by 
some function f, in the sense that f is of order m; at each point P;, and of order zero 
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everywhere else. For example we saw in Section 4 that each divisor Pj”! P;"? «+» Py"* 
on CU {co} is realized by a rational function, provided that m; +---+m,z = 0 
(and that we need some P; = oo if the remaining m,; have a positive sum). 

For other Riemann surfaces, as we also know from Section 4, there are further 
constraints on zeros and poles, and it is more difficult to decide which divisors are 
principal. We therefore begin by studying a weaker relationship between functions 
and divisors, called divisibility (which helps explain why they are called “divisors” ). 
We say that P/"' Pj? --- Pi"* divides a function f if f has order at least m; at point 
P;, and order zero at all other points. It follows, for example, that if P; divides 
a product fg then P; divides f or P; divides g, so the points behave like prime 
divisors. 

Dedekind and Weber use the word “polygon” instead of “divisor,” and they al- 
low only positive m;. Such divisors are now called “effective” or positive. However, 
Dedekind and Weber allow quotients of their “polygons,” so they essentially use all 
divisors. Also, they realize a divisor D much as they realize an ideal, by a set of 
actual algebraic functions—namely, by the set of all f in the given function field 
divisible by 9. Then they can define divisibility of divisors exactly as for ideals: “to 
divide is to contain.” It is also not hard to recognize divisibility from the symbols 
for divisors. For example, P, divides P? and P;/P3 divides P,/P2. In general, a 
divisor a divides divisor 6 if the points in ba~! occur to powers > 0. With these 
concepts in place, one can set out on the long road to the theorems of Abel and 
Riemann-Roch discussed in Sections 1 and 4. 


11. From Points to Valuations 


The sections §14 and §15 of Dedekind-Weber can be read today as a 
first introduction to valuation theory—but a general valuation theory 
in arbitrary fields was first launched in 1913 by Ktirschak, prompted 
by the p-adic ideas of Hensel and without reference to places [that is, 
points]. The connection between places, valuations, and integral clo- 
sure present in Dedekind and Weber in the first phase of development 
was lost in the general valuation theory. 

Geyer (1981), p. 120 (my translation). 

For each point P of the Riemann surface Sp, Dedekind and Weber define what 
they call the order numbers at P for each function f € F that either vanishes or 
becomes infinite at P. Relative to a variable z that is finite and nonzero at P, f 
has order r at P if f(z)/z" is nonzero at P but f(z)/z"~1 is not. Likewise, f(z) 
has order —r at P if f(1/z) has order r; that is, if f(z) “goes to infinity like z”.” 

Thus each point P € Sr defines what we now call a discrete valuation on F: a 
mapping v, from F' into the integers with the properties that 


v.(fg) =v.(f) + v2(9), 
ve(f +9) > min(v.(f),v(9)) if f+ 9 £0. 


Since points P # co correspond to prime ideals p, the valuation v, may also be 
associated with p, and indeed 


v.(f) = exponent of p in the prime ideal factorization of f. 


When viewed in this way, the concept of valuation is unhitched from the exis- 
tence of “points,” and it applies wherever unique prime ideal factorization holds. 
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Indeed, with hindsight it becomes apparent that a discrete valuation is behind 
Kummer’s concept of “ideal numbers.” Kummer does not define his ideal prime 
numbers p; in effect, he defines a discrete valuation v on the cyclotomic integers a 
such that 


v(a) = exponent of p in the prime ideal factorization of a. 


Even for the field of rational numbers one has an interesting valuation associ- 
ated with each prime p: 


v(r) = exponent of p in the prime factorization of r. 


This valuation is called the p-adic valuation, and it was introduced by Hensel (1897) 
to define the p-adic numbers, later found to be a powerful tool in algebraic number 
theory. It seems unlikely that the p-adic valuation would have been considered 
promising if not for the success of valuations in the theory of algebraic function 
fields. So here we see influence flowing in the opposite direction: from the theory of 
function fields back to its origin and model, the theory of algebraic number fields. 

In a reversal of the historical development (which is common, but alas seldom 
mentioned), accounts of the theory of algebraic numbers and functions today often 
begin with the concept of valuation. They then proceed to define “points” (or 
places, as they are now called), ramification, genus, and so on. See for example 
Artin (1951) or Cohn (1991). 


12. Reading the Dedekind-Weber Paper 


The influence of Dedekind on the paper is obvious, since it follows his approach 
to algebraic number theory, almost to a fault. As in Dedekind (1877), the ground- 
work for ideal theory is laid with a thorough study of modules and their divisibility 
theory, with the help of concepts such as basis, norm, and discriminant. However, 
in contrast to Dedekind (1877), the present paper is quite formal, unmotivated, and 
lacking in historical perspective. This may be because Dedekind was busy working 
on the third edition of Dirichlet’s Vorlesungen tiber Zahlentheorie at the time, so he 
entrusted the final writeup of the paper to Weber. In the foreword to that edition 
(dated 11 November 1880) he placed his supplement on ideal theory in a context 
that included his work with Weber, as follows: 


We mention in particular the extended presentation of ideal the- 
ory, in the last supplement, which I first published in the second 
edition, but in such terse form that the wish for a more detailed 
version was expressed to me. I have been glad to return to this 
request since working with my friend H. Weber of Konigsberg 
on an extended investigation, still to appear, whose import is 
that the same principles may be successfully carried over to the 
theory of algebraic functions. 


In a letter to Weber (30 October 1880, on p. 488 of Dedekind’s Werke, volume III) 
Dedekind wrote: 


I take this opportunity to express my deepest thanks to 
you for this work of almost two years, which has given you an 
infinite amount of trouble, but which has given me the greatest 
joy and has made a significant scientific advance. It is a beautiful 
feeling to find a new truth, as Pascal in his first letter to Fermat 
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expressed so well: Car je voudrais désormais vous ouvrir mon 
coeur, s’il se pouvrait, tant j’ai de joie de voir notre rencontre. 
Je vois bien que la vérité est la méme & Toulouse et & Paris.? I 
often think back over the progress of our work which, after many 
oscillations, has acquired more and more the character of inner 
necessity ... 


Another sign of Weber’s hand in the paper is that when Weber (1908) returned 
to the subject in the third volume of his Algebra, §§170—201, he gave a very sim- 
ilar treatment—even using the same notation—but with a few extra explanatory 
remarks. 

For whatever reason, the Dedekind-Weber paper is not such easy reading as 
Dedekind (1877). In an attempt to make the reading easier, I taken the liberty 
of inserting a short “Summary and Comments” at the beginning of each section 
(marked by the § symbol). In these insertions I try to summarize the main concepts 
and results of the section, with comments on where they come from and where they 
are going. I have sometimes drawn on the very useful commentary of Geyer (1981) 
in Dedekind et al. (1981), and on the account of the Dedekind and Weber’s theory 
in Koch (1991), Chapter 23. It may also be worthwhile to consult the first chapter 
of Eichler (1966), which extracts the linear algebraic core of Dedekind and Weber’s 
proof of the Riemann-Roch theorem with remarkable concision. (Eichler’s book is 
also interesting as a modernization of Weber (1908), which is a synthesis of ideas 
from both Dedekind and Kronecker.) 

However, as I remarked in the Preface, all commentators on the Dedekind- 
Weber paper seem to attend to only some of the ideas it contains, while ignoring 
others. No doubt my own commentary fails to highlight certain ideas as much as 
it should, but I hope that the zone of obscurity has at least been reduced, so that 
any unnoticed gems can be more easily found. Ultimately, of course, the only way 
to know exactly what Dedekind and Weber are saying is to read their own words. 

With only a few exceptions, I have retained Dedekind and Weber’s terminology 
and notation, in order to faithfully represent their train of thought. For example, 
it is necessary to retain their awkward term “polygon” for what we now call a 
“divisor.” At the time of writing their paper, Dedekind and Weber were prepared 
to view numbers, functions, ideals, and modules as “divisors,” but “polygons” were 
a new (and more geometric) idea, which they understandably wanted to highlight 
with a different, and geometric, word. 

On the other hand, I think it unnecessarily literal (and tedious) to translate 
“vanz rational Funktion” as “entire rational function” when “polynomial” better 
conveys the idea to the modern reader. Similarly, the symbol S for trace (“Spur” in 
German) has been rewritten as Tr, since this does not clash with the other notation 
and it carries more meaning today. The only other change in notation, which occurs 
only in §10, has been to use 6,;s for the Kronecker delta in place of Dedekind and 
Weber’s ad hoc notation (r,s). Since the notation (, ) now has many meanings 
(and it already did in the Dedekind-Weber paper) it can only be confusing to retain 
it when an unambiguous replacement is now available. 


°For I now want to open my heart to you, if I may, because I am so overjoyed by our 
agreement. I see that the truth is the same in Toulouse as in Paris. 
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As mentioned in Section 5, Dedekind and Weber use the term Schaar for what 
we now call a vector space, but without establishing the basic vector space proper- 
ties once and for all. I have decided to translate the word Schaar as “vector space,” 
since this seems to be the least confusing option, but readers should be prepared 
for repeated statements of vector space properties that we now take for granted. 

In any case, I have footnoted the places where deviations in terminology or 
notation occur, so readers are in a position to reconstruct the original if they so 
desire. 


13. Conclusion 


The aim of this Introduction has been to expose the historical roots of the 
theory of algebraic functions, and to give a sense of the mathematical climate at 
the time when Dedekind and Weber were writing their paper. 

Dedekind and Weber (1882) arrived at a time of great ferment in the theory 
of algebraic functions, with Kronecker (1882) being published in the same journal, 
and the works of Klein (1882) and Poincaré (1882) published in the same year. 
Klein and Poincaré championed Riemann’s methods and their inspiration from 
physics—an unpopular view at a time when the goal was to “arithmetize” analysis, 
under the influence of Weierstrass and Kronecker. The aim of arithmetization was 
to avoid appeal to geometric and physical intuition by reducing the concepts of 
analysis to properties of natural numbers and sets of natural numbers. At the 
time, arithmetization prevailed, and it still does, inasmuch as Kronecker’s term 
“divisor” is now used in all books on Riemann surfaces. However, the analytic 
approach made something of a comeback after Riemann’s ideas were vindicated by 
Weyl (1913), and today it is likewise true that most books on Riemann surfaces 
make heavy use of analysis. To see the arithmetic/algebraic approach in its greatest 
purity, the paper of Dedekind and Weber is probably still the best place to look. 
The Dedekind-Weber arithmetization was not really the future of analysis, but it 
was the future of algebraic geometry. 


John Stillwell 
South Melbourne, 1 May 2012 
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Introduction 


SUMMARY AND COMMENTS 


In their introduction, Dedekind and Weber emphasize their two 
great innovations: the arithmetic theory of algebraic function fields, 
based on Dedekind’s theory of algebraic number fields, and the 
algebraic definition of Riemann surfaces via “points,” each of which 
is a consistent assignment of values to the functions of the field. 
They do not attempt to extract the common essence of the theories 
of function fields and number fields, which was first found by Emmy 
Noether in 1927 in the theory of Dedekind rings. 

At the end of the introduction they express the hope of return- 
ing to the subject “on another occasion,” in order to deal with the 
question of Abelian integrals and their periods. In fact, they did not 
return to the subject, perhaps because such questions are beyond 
the scope of their methods. In giving the first rigorous proofs of the 
Abel and Riemann-Roch theorems, they had done quite enough. 


The purpose of the following investigations is to construct the theory of al- 
gebraic functions of one variable, which is one of Riemann’s great creations, on 
the basis of a simple, yet at the same time rigorous and completely general view- 
point. In previous investigations of this topic certain restrictive assumptions about 
the singularities of the functions under consideration have been made, as a rule, 
and the so-called exceptional cases have either been mentioned casually as limit- 
ing cases, or else left aside entirely. Likewise, certain basic theorems on continuity 
and developability have been assumed, on the evidence of geometric intuition. A 
sounder basis for the fundamental concepts, as well as a general treatment of the 
theory, without exceptions, is obtained when one proceeds from a generalization 
of the theory of rational functions of one variable, in particular, from the theorem 
that each rational function of one variable admits a decomposition into linear fac- 
tors. This generalization is simple and well known in the first case, in which the 
number Riemann denoted by p (called the genus by Clebsch) has the value zero. 
For the general case, which is related to the one just mentioned in the same way 
that general algebraic numbers are related to rationals, the way forward is found 
by carrying over to functions those methods that are most successful in the theory 
of numbers, including Kummer’s creation of ideal numbers.'° 


10kKummer first introduced ideal numbers in the work Zur Theorie der complexen Zahlen 
(Crelle’s Journal, v. 35). A further development and general presentation of the theory of al- 
gebraic numbers may be found in the second and third editions of Dirichlet’s Vorlesungen tiber 
Zahlentheorie, as well as in the work of Dedekind: Sur la théorie des nombres entiers algébriques 
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If, in analogy with number theory, a field of algebraic functions is understood 
to be a system of such functions with the property that application of the four 
arithmetic operations to functions of the system always leads to functions of the 
same system, then this concept coincides with that of the Riemann class of algebraic 
functions. Any function in such a field can be regarded as the independent variable 
and the rest as dependent on it. Each such “mode of representation” yields a system 
of functions of the field called integral algebraic functions, the quotients of which 
exhaust the whole field. Among these integral algebraic functions another group 
of functions may be distinguished, with the characteristic of polynomials that they 
admit a greatest common divisor. Such a divisor does not exist in general; however, 
when theorems on rational functions are related not to the divisor itself, but to the 
functions divisible by it, they carry over completely to general algebraic functions. 
In this way one arrives at the concept of ideal, a name stemming from Kummer’s 
work in number theory, where nonexistent divisors are replaced by “ideal divisors.” 

While the present work by no means involves “ideal functions”, but only sys- 
tems of actually existing functions, it seems useful to retain the name “ideal”, which 
is already customary in number theory. 

With a suitable definition of multiplication it is possible to calculate with ideals 
using the same rules as for rational functions. In particular, one has the theorem 
that each ideal decomposes uniquely into factors that are themselves indecompos- 
able, and hence may be called prime ideals. These prime ideals correspond to the 
linear factors in the theory of polynomials. On this basis one attains a completely 
precise and general definition of a “point of a Riemann surface”, i.e., a complete 
system of numerical values that can be consistently attached to the functions of the 
field. 

This in turn yields a formal definition of the differential quotient, the genus, and 
a quite general, elegant presentation of differentials of the first kind. Finally there 
is the proof of the Riemann-Roch theorem on the number of arbitrary constants in 
functions defined by their poles, and the theory of differentials of the second and 
third kinds. Up to this point the continuity and developability of our functions 
do not come into consideration. It would be completely valid, e.g., to restrict the 
domain of numbers to algebraic numbers. In this way a well-defined and apparently 
broader theory of algebraic functions can be treated by methods belonging to its 
own sphere. 

Admittedly all these results come from Riemann’s theory using a far narrower 
range of methods, and as special cases of something more general. However, it is 
known that a rigorous foundation of Riemann’s theory presents certain difficulties 
and, until these have been completely overcome, it may be that the path we have 
taken, or at least something similar, is the only one leading to the goal with sat- 
isfactory rigor and generality. The theory of ideals itself would be much simpler if 
one could assume the intuitive concept of a Riemann surface and particularly that 
of its points, together with the continuity of algebraic functions. Our work, on the 
other hand, takes a long algebraic detour through the theory of ideals, leading to a 
completely precise and rigorous definition of a “point of a Riemann surface” that 


(Paris 1877. Reprinted from the Bulletin des Sciences math. et astron. of Darboux and Hoiiel). 
However, knowledge of these publications is not assumed in the present work. 

We have learned from verbal communications that for some years Kronecker has been carrying 
out investigations, related to the work of Weierstrass, based on the same foundations as ours. 
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can also serve as a basis for the investigation of continuity and related questions. 
The latter questions, which include those on Abelian integrals and their periods, 
are left aside for the present. We hope to come back to them on another occasion. 


Konigsberg, 22 October 1880. 
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81. Fields of algebraic functions 


SUMMARY AND COMMENTS 


In §$1, 2, and 3 of their paper, Dedekind and Weber describe 
the general framework for their study of algebraic functions: a finite- 
degree extension of the field C(z) of rational functions of one vari- 
able. The basic concepts are modeled on the analogous concepts 
in Dedekind’s (1877) theory of algebraic integers, where the frame- 
work is a finite-degree extension of the field of rational numbers. 

Just as an algebraic number is the solution of a polynomial 
equation with rational number coefficients, an algebraic function 
is the solution of a polynomial equation with rational function co- 
efficients. The degree of a function is the minimal degree of the 
equations that it satisfies. As in the number case, there is a proof 
that sum, difference, product, and quotient of algebraic functions 
are algebraic. Indeed, if 6 is a function of degree n, then the lin- 
ear combinations of 1,6,...,0”~1 with rational function coefficients 
form a field of degree n, with basis {1,0,...,0"~+}. Such algebraic 
function fields Q are the basic object of study in what follows. 


A variable @ is called an algebraic function of an independent variable z when 
it satisfies an irreducible algebraic equation 


(1) F(0,z) =0. 
Here F denotes an expression of the form 
F(6, z) = ag0” +.,0° 7! +++ + an_10 + Gn, 


where the coefficients ag, a1,...,@,, are polynomial functions of z without common 
divisor. The assumed irreducibility of equation (1) means that ™ does not satisfy 
an equation of lower degree in @ and, as follows from the algorithm for greatest 
common divisor, if 


G(6,z) = b90™ + 6,01 ++ + bm 10 + bm = 0 


is a second equation satisfied by 6 then G(0, z) must be divisible by F'(6, z). It may 
now be proved that G(0,z) also cannot be of lower degree in z than F'(0,z), and 
it is of the same degree only if G(0, z) differs from F'(0, z) by a factor independent 
of z. If we assume that the coefficients bo, b1,..., bm are freed of common factors, 
and if we let 


H(0,z) = cA" + ON, ale abe Sa 
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denote the denominator-free quotient of G by F’, then 
kG(6,z) = F(6,z)- H(6, z), 


where & is a polynomial function of z. Comparison of coefficients yields 


kb, = age, + a1€0, 
kb = agcg + ac, + a2€0, 


where the co, C1,..-,Cm—n can likewise be assumed to have no common divisor. 

It follows from this, first, that k must be constant, so that it can be set equal 
to 1. Because if ag, @1,...,@r—1, C0, C1,---;€s—1 are divisible by a linear factor of k 
and a,,cs are not, then the equation 


kbr+s a Qr—1Cs41 + GpCg + Ar+1€s—1 apa 
yields the contradiction that a,c, is divisible by the same linear factor. It then 
follows that the degree of G(6,z) in z is the sum of the degrees of F and H in z, 
because if a,,cs are the first among the coefficients a,c to have degrees of maximal 
value then it again follows from 

br+s a Qr—1Cs4+1 + GpCg + Qr+1Cs—1 aparece 
that the degree of 6,4, equals the sum of the degrees of a, and cg. 

If one divides equation (1) by ao then it can also be written in the form 

(2) f(9, 2) = 0" + b,0"—* +--+ + bn_16 + bn = 0, 


where the coefficients b,, b2,...,b, may be fractional rational functions of z. 

The system ®(6, z) of all rational functions of # and z has the property of closure 
under the elementary arithmetic operations of addition, subtraction, multiplication 
and division, and this system is therefore called a field of algebraic functions of 
degree n. If (0) is a polynomial function of 6 whose coefficients are rational in z, 
then by algebraic division one can determine two functions q(0),7(0), the second of 
which with degree no greater than n — 1, such that 


9(9) = g(9) F(A) + 7r(8). 
Then, by (2) 
p(8) = r(9). 
If y(@) is not divisible by f(@) then (by the assumed irreducibility of f(@)) these 


two functions have no common divisor and hence, by the method of finding the 
greatest common divisor, one obtains two functions f1(0),~1(@) such that 


F(A) Fi(@) + P(A) er (8) = 1, 
and hence by (2) 
= 
These remarks, along with the assumed irreducibility of f(0), yield the following: 
Theorem. Each function ¢ in the field Q is uniquely expressible in the form 
C= 29 +010 ++: +a 0", 


where the coefficients %9,2%1,...,%p,—1 are rational functions of z. Conversely, each 
function of this form obviously belongs to the field Q. 
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If one chooses n functions arbitrarily from the field 2: 


m2 Ob ta 0 
2 = c®) zi 6 aos adaals oc?) 9r-1, 


but in such a way that the determinant! 
(1) ,.(2) (n) 
pie. xy 8 aa 


is not identically zero, then it follows that every function in the field 2 is expressible 
in the form 


C= yim + yote +++ + Yntns 
where the coefficients y), y2,..-,Yn are rational functions of z. Such a system of 
functions 71, 72,---,%m Will be called a basis of the field Q. 
A system of functions 71, 72,...,%n in the field 2 is a basis if and only if they 
satisfy no equation (identity) of the form 


yim + yone +--+ YnMn = 9 


in which the coefficients yj, y2,..-,Yn do not all vanish. For example, the functions 
1,0,67,..., 0"—! form a basis of 2. 


§2. Norm, trace, and discriminant 


SUMMARY AND COMMENTS 


Just as every algebraic number a in a field of finite degree has 
a norm N(a), which is a rational number, so does each algebraic 
function ¢ in a field Q of degree n have a norm N(C), which is 
a rational function. The basic properties of the algebraic function 
norm are like those of the algebraic number norm: the norm of a 
rational function is its nth power, and the norm is multiplicative: 
N(¢¢') = N(¢)N(¢'). As with algebraic numbers, this often allows 
divisibility questions for algebraic objects to be reduced to divisibility 
questions for rational objects. 

But in this section, the norm comes from a determinant—the 
characteristic polynomial of multiplication by ¢ on (Q—rather than 
as a product of conjugates, as in the algebraic number case. The 
product of conjugates is somewhat more subtle in the function case, 
and it is taken up much later, in §15. 

The present approach has the advantage of revealing another 
useful concept: the trace T'r(¢). N(¢) and Tr(¢) are, up to sign, 
the coefficients of ¢? and ¢”~! in the characteristic polynomial 
equation satisfied by ¢. The basic property of the trace is that it is 
additive: 

sO Ce eae A) a (Og 


11The curious shorthand SS £411422---Gnn for the determinant with (i, j)-entry a;; is often 
used by Dedekind and Weber in this paper. (Translator’s note.) 
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The third important concept in this section, the discriminant, is 
likewise motivated by the concept of the same name in Dedekind’s 
(1877) algebraic number theory. The discriminant A(71,72,---, mn) 
of functions 71, 72,---;%m € Q is a determinant that tests whether 
1;12,+++,1m form a basis of 2, being nonzero if and only if they 
do. The vanishing of A(m,12,---, 7) when 71, 172,---,% do not 
form a basis of 2 also follows from the fundamental theorem on 
discriminants, namely 


AG Hee, Se As asi 


where X is the determinant of the linear equations expressing the 
functions 7,,75,..., 1), in terms of the basis elements 71, 72,---,In- 


Given an arbitrary basis 7),72,...,%n for the representation of functions in 2, 
and any function ¢, one can set 


Cm = yim + Yi272 + +++ + Yin, 
(1) Cn2 = yom + y2,272 + +++ + Y2nMn, 


Cnn = Yn, 1 + Yn,212 quested Yn,n"IWn5 


where the coefficients y, ,, are rational functions of z. This yields the equation 


Y1,1 — ¢ Y1,2 see Ylyn 
(2) Y2,1 Yo. 9° ¢ see Y2n —0, 

Yn,1 Yn,2 see Unyn — ¢ 
which, when arranged according to powers of ¢, has the form 
(3) p(¢) =O" 46,67" +++ bn-16 + bn = 0. 
And it follows easily from the multiplication theorem for determinants’” that this 
equation is independent of the choice of basis 71, 72,...,%. Among the coefficients 
by, bg,..., 6, of the function y, which are all rational functions of z and completely 


determined by ¢, there are two that are given special names, due to their importance 
in what follows. The function 


Y1,1  Y1,2 +++) Yin 
(4) (=) b. = Y2,1 Y2,2 ce Y2,n 
Yn,1 Yn,2 sae Ynn 


is called the norm of the function ¢ and is denoted by N(¢). This function is subject 
to the following theorems. 


1. The only function whose norm vanishes identically is the “zero” function. 
Because if N(¢) = 0 in system (1) it follows that there is a system yi, y2,---,Yn of 
rational functions of z, not all vanishing, such that 


C(yim + Yom + +++ + Yntin) = 0. 
Then, since 71, 2,---,n is a basis of 0, ¢ = 0. 


I12That is, the theorem that the determinant is multiplicative. (Translator’s note.) 
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2. The norm of a rational function of z is the nth power of the function. 
Because if ¢ is rational the equations (1) reduce to the identities Cn, = Cnn, whence 
N(¢) =". 

3. If ¢’ is a second function in the field Q and if the equation system for this 
function, corresponding to (1), is 


"nh a2 oS YrcMs 
L 


then it follows that 
C6'mn = So Une 


uu! 


whence 
NCO) = NN) 


by the multiplication theorem for determinants. 


4. It follows from 2 and 3 that 


hence 


(8-88 


5. Finally, the definition of the function y, together with (2) and (3), yields 
the important theorem: if ¢t is an arbitrary constant (or a rational function of z) 
then 


p(t) = N(t— ¢). 
We now call the function 
(5) —by = Yaya + Y2,2 $2°° + Ynyn 


the trace of ¢ and denote it by Tr(¢).1° The following theorems about it follow 
immediately from the definition: 


(6) Tr(0) =0, 
(7) Tr(1) =n. 
Also, when « is a rational function of z and ¢,¢’ are two functions in 0: 
(8) Tr(x¢) = «Tr(G), 
(9) Tr(¢+¢') =Tr(¢) + Tr(¢’). 


It follows from these considerations that every function ¢ in Q satisfies an nth 
degree equation y(C) = 0 whose coefficients are rational in z. When this equation 
is irreducible, the functions 1,¢,¢?,...,¢"~! form a basis of ©. If not, let 


(10) gr(G) = 6° + bOC T+ + O16 +b, = 0 
be the equation of lowest degree, with rational coefficients in z, satisfied by ¢. 


Thus yi(¢) = 0 is irreducible and e < n. Since y(¢) also vanishes, y(¢) must be 


13Dedekind denotes it by S(¢) because the German word is “Spur.” (Translator’s note.) 
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algebraically divisible by yi(¢), and it follows as in §1 that each rational function 
7 of z and ¢ is representable in the form 
1) = Xo + x46 ae Gaal 

where the coefficients x9, 71,...,%e—1 are rational in z. Now if 

of +0f-1 Aecsisss + nf-19 + nF =0 
is the equation of lowest degree satisfied by 6 with coefficients rational in z and ¢, 
then the e- f functions 
(11) Chk (h=0,1,;...,e— 15h =0,1,..., f —1) 


satisfy no linear equation with coefficients rational in z, while each function in Q is 
representable as a linear combination of these functions with coefficients rational 
in z. It follows that the functions (11) form a basis of 2, so that 


e-f=n 


and e is a divisor of n. 
If one uses the basis (11) to construct the norm of ¢, then one sees easily from 
equation (10) that 


N(Q) = ((-1)°)f = (-1) "02. 
Since, in addition, the function ¢ — t for any constant t satisfies an equation of the 
same degree as ¢ does, we have the theorem: 


6..° The function y(t) in (3) is either irreducible or an integer power of an 
irreducible function. 

If 71,72,---;7m is any system of n functions in 2, whether a basis or not, 
then we associate with this system a rational function of z called the discriminant, 
denoted by A(m,12,---,%n) and defined as follows 


Trimm) Tr(mm) ... Tr(mnn) 
Tr Tr wae AEP 
(12) ACM 125+ ++51n) - ae ia eee ie 
Trimm) Tr(tmmn) -.- Tr(tmnn) 
The discriminant is identically zero if and only if the functions m,12,..-,;n form 
a basis of Q. 
To prove the first part of this assertion, we suppose that A(m1,72,..-,1n) = 0. 
Under this hypothesis there is a system of rational functions yj, y2,...,Yn of z, 


which do not all vanish, such that 


yilr(mne) + yet r (none) + 2+ + YnTr (mane) = Tr (me (yim + yana + +++ + Yntn)) 
=0 (RS Toe ee scai): 


M4The equation y1(¢) = 0 gives rise to a field 2, of degree e, of algebraic functions that also 
belong to Q, and hence it may be called a divisor of the field Q. (Translator’s note: This use of the 
word “divisor” to denote a substructure is at odds with the arithmetical definition, as Dedekind 
and Weber concede when they define divisibility of modules in §4, 7.) 

15This subsection is wrongly labelled 10 in the original. However, no confusion arises, because 
it is never cross referenced. (Translator’s note.) 
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Hence if one chooses an arbitrary system 21, 2%2,...,£» of rational functions of z 
and sets 


yim + yene +++ + YntIn = 7); 
zim + oN +++: + Inn = &, 
then it follows that 


Tr(én) = 0. 


But when the functions 71, 172,...,%n form a basis of 2 the function € can be arbi- 
trary in Q, and since 7 does not vanish, it can for example be 4. The last equation 
is then certainly not satisfied, and hence under this hypothesis the discriminant of 


1;72,+--,%n does not vanish identically. 
If we now assume that 71, 72,.--,%n form a basis of Q and set 
Ne = C1,kM + L2,KN2 + +++ + Ln,kNns (k =1,2,...,n) 
then the functions 7,,75,..-,7}, form a basis of Q, or not, according as the deter- 
minant 


X= ; 21 ,1%2,2 eis ‘Inn 


of the rational functions x, of z is nonzero or not. But 


Tr(m, Ne) = S- Lint kL (mM); 
1<u,e’<n 


and this, together with the multiplication theorem for determinants, yields the 
fundamental theorem on discriminants 


(13) Ali Moitits hy) Se AGT sxe sin) 


which also shows the correctness of the second part of the assertion above, namely, 
that the discriminant of a system of functions vanishes when they do not form a 
basis for 2. 


§3. The system of integral algebraic functions of z in the field 2 


SUMMARY AND COMMENTS 


Having established a suitable framework for algebraic func- 
tion theory—a finite-degree extension (2 of the field of rational 
functions—Dedekind and Weber now identify the “integers” of (: 
the integral algebraic functions. These functions form a ring and 
have properties analogous to those of the algebraic integers studied 
by Dedekind (1877). For example, just as the norm of an algebraic 
integer is an ordinary integer, the norm of an integral algebraic 
function is an “integral rational function,” that is, a polynomial. 

One finds throughout that polynomials appear in the theory of 
algebraic functions where ordinary integers appear in the theory of 
algebraic integers. 

The appropriate concept of basis for integral algebraic func- 
tions, naturally, is an independent collection of such functions, the 
linear combinations of which (with polynomial coefficients) are all 
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the integral algebraic functions in (2. It is shown that such “in- 
tegral bases” exist, and that they all have the same discriminant. 
This invariant discriminant is called the discriminant of Q. 


Definition. A function w in the field Q is called an integral algebraic function 
of z when, in the equation of lowest degree satisfied by w according to §2, 


(1) p(w) = w* + byw? +--+ + dew + be = 0, 


the coefficients 01, b2,...,be are polynomial functions of z. In the contrary case it 
is called a fractional function. The collection of all integral algebraic functions of 
z in 2 will be denoted by o. Since, by §2, N(t — w) is an integer power of y(t), it 
follows that if w is an integral algebraic function then all the coefficients of N(t—w) 
are polynomial functions of z, hence in particular: 


1. The norm and trace of integral algebraic functions are polynomial functions 
of z. 


In addition, the definition of integral algebraic function yields: 


2. A rational function of z belongs to the system o if and only if it is a 
polynomial function of z. 


3. Each function 7 in 2 can be converted into a function in o by multiplication 
by a nonzero polynomial function of z. By 82, 7 satisfies an equation of minimal 
degree of the form 


bon® t bin?" Peet be-10 + be = 0, 


whose coefficients are polynomial functions of z, and this becomes an equation of 
the form (1) for w under the substitution bo7 = w. 


4. A function w in the field Q that satisfies any equation of the form 
w(w) =w™ = rere ies tee et Cm—1W + Cm = 0, 


in which the coefficients c,,...,Cm are polynomial functions of z, is an integral 
algebraic function. Because if 


yw) =w® + byw?) +--+ + bow + be = 0 


is the equation of lowest degree satisfied by w then (w) must be algebraically 
divisible by y(w): 

¥@) = o)x@), 
and it is easily seen that this implies that the coefficients of p(w) and y(w) are also 


polynomial functions of z (Gauss, Disquisitiones Arithmeticae, art. 42).1° This 
yields the fundamental theorem on integral algebraic functions: 


5. The sum, difference and product of two integral algebraic functions are also 
integral algebraic functions. 


16This is a reference to the famous Gauss’s lemma, which is actually about polynomials with 
rational number coefficients, but a similar lemma holds for polynomials with rational function 
coefficients. (Translator’s note.) 
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Namely, if w’,w” are two integral algebraic functions in 9 that satisfy the 
respective equations 


rg yt 
wv” + bbw’ 1 4s OL! +0, = 0, 
wt uw 
ag + byw!” =4 free Hp Peay + ie => 0, 
then one can, by letting w1,we,...,Wm be the m = n'n” products 


gg WSO Awan HBA H O01 ayn! 1) 


and letting w denote one of the three functions w! + w”,w/w"’, set:1” 


WW = 11 1W +e +X mm, 
WWm = Im 1W1 Speech Llm,mWm; 


where the xp,,, are polynomial functions of z. And from this one obtains 


T11—-—W %1,2 see Lim 

T2,1 LQ2—-—W «1.4 LIm = 
>> = 

Tm, Lm,2 see Umm — W 


which is an equation for w whose coefficients are polynomial functions of z. 
This yields the corollary that each polynomial function of functions in o is itself 
in o. 


6. An integral algebraic function w is said to be divisible by another integral 
algebraic function w’ when there is a third integral algebraic function w” satisfying 
the condition 

w= wu”, 
It follows immediately from this definition that: 
If w is divisible by w’, and w’ by w”, then w is also divisible by w”. 
If w’ and w” are divisible by w then w’+w” is also divisible by w, and in general: 


if Wi, W2,w3,... are divisible by w and if w},w,w4,... are any functions in o , then 
ww + whwe + wew3 +--- is also divisible by w. 
7. If the functions 71, 72,..-., 7 form a basis of 2, then (by 3) one can determine 
n nonzero polynomial functions a), a@2,...,@,, of z so that 
Wy = 4171, W2= 4272, +--+, Wn = Ann 


are integral algebraic functions, and the latter likewise form a basis of 0, since 
A(w1,W2,--+,Wn) = ajay +» a, A(m, M2, ---5 Mn) 


is nonzero. Thus 2 has bases wi, we,...,Wn consisting wholly of integral algebraic 
functions, and the discriminant of such a basis is itself a nonzero polynomial func- 
tion of z, since the T’r(w,w,) are polynomial functions of z. Each function of the 
form 


(2) W = 01W, + LoW2 +--+ + LpWn, 


I7This is essentially the same as the argument by which Dedekind (1871), p. 437, proves that 
the algebraic integers are closed under sum, difference and product. (Translator’s note.) 
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in which the x1, r2,..., £» are polynomial functions of z, then belongs to the system 
o. However, it certainly does not follow that, conversely, each function in o is 
representable in this form. 
If we suppose that o includes functions not of the form (2) then it is possible 
to choose a linear function z — c and certain polynomial functions 2), %2,...,%n, 
not all divisible by z —c, so that 
LW, + XoWg +--+ 2pWy 


z—-c 
is an integral algebraic function. The functions 71, %2,...,%, may now be reduced 
to their constant, and not all zero, remainders cj, ¢2,...,Cn modulo z —c, and one 


sees that 
CW + CoW2 +++: + CnWn 


zZ—C 
is also an integral algebraic function. If c, is nonzero, then the n integral algebraic 
functions 
w and wW,W3,...,Wn 
also form a basis of Q and at the same time, by §2 (13), 
2 


at 


A(w,We,...,Wn) = 7 A(w1,We,-.-,Wn) 


z—c) 
is of lower degree than A(w1,w,...,Wn). Now, since both these discriminants are 
polynomial functions of z, by repeating this process one finally arrives at a basis 
of 2 consisting of integral algebraic functions w{,w5,...,w/, with discriminant of 
a degree that cannot be lowered, and consequently each function w ino is of the 
form 

Ww = Wy + LQWs + +++ + Lp Wh, 


with polynomial functions of z as coefficients. Such a system will be called a basis 


of o. 
If wy, W2,...,Wn is a basis of o and if 
/ 
WwW, = @,1W1 + 2, 9W2 +++ + 2inWn, (c = 1,2,...,n) 
then the system w},w5,...,w/, is a basis of o if and only if the determinant 
X= y £21,1%2,2°°*Lnjn 


of the polynomial functions x,,,, is a nonzero constant. Because if this determinant 
has any linear factor z — c then it is possible to determine constants c1,C2,...,€n, 
not all zero, so that the n polynomials in z 


C1X1, + C2%Q4 Fee + Cnn 
are divisible by z— c (i.e., they vanish for z = c). But then 


Cw, + CqWh + +++ + cpwr, 


z—c 
is an integral algebraic function and hence w},w5,...,w/, is not a basis of o. 
On the other hand, since 
/ / / 2 
A(wy,W9,---,W,) = X*A(w,We,.--,Wn), 


it follows that the discriminant of a basis of 0 is independent of the choice of basis, 
apart from a constant factor. Thus one obtains a definite polynomial in z when one 
divides the discriminant of any basis of 0 by the coefficient of the highest power of 
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z. The latter function will be called the discriminant of the field Q or of the system 
o, and it will be denoted by A(Q) or A(o). 


§4. Modules of functions 


SUMMARY AND COMMENTS 


To pave the way for the study of ideals of integral algebraic 
functions, Dedekind and Weber now introduce the concept of a 
module: a set of algebraic functions closed under the operations 
of addition, subtraction, and multiplication by polynomials. The 
analogous concept of module in algebraic number theory, studied 
in §1 of Dedekind (1877), requires only closure under addition and 
subtraction (but of course these yield closure under multiplication 
by integers). The initial development of the module concept is the 
same in both settings. As usual, there is a concept of basis, and all 
bases of a module have the same size. 

What is special, and crucial, about modules is the concept of 
divisibility. lt is motivated by the concept of divisibility for ideals: 
“to divide is to contain.” That is, a module 6 divides module a if 6 
contains a. In conformity with this concept of divisibility, the least 
common multiple of two modules a,6 is simply their intersection 
ab. Not so obviously, the greatest common divisor of a, 6 is the 
set of sums a+ 8, where a € aand BE b. 

The product ab is defined to be the set of all finite sums > a3, 
where a € a and 6 € b. As Dedekind and Weber point out, it is 
not generally the case that a divides ab. However, this turns out to 
be the case for ideals (see §9, 4). 


In what follows we consider systems of functions that we call modules of func- 
tions or simply modules, and define as follows. A function system (in Q) is called 
a module when it is closed under addition, subtraction and multiplication by poly- 
nomial functions of z. 

If a1, Q2,...,Q@m are m given functions and 21, %2,...,%m are arbitrary poly- 
nomial functions of z, then the collection of all functions of the form 


A= 1A, + %2A2 +°+: + LmAm 
is a module. Such a module is said to be finitely generated'® and is denoted by 
a = [01,Q2,..-,Am]- 


The system of functions a1, Q2,...,Q@m is called a basis of this module. 
We call a function system a1, Q2,...,Qm rationally irreducible, or call the func- 
tions Q1,Q2,...,Qm rationally independent, when an equation of the form 


21a, + 222 +--+: +2%mAm = 0 


18I the language typically used for finitely generated structures at that time, Dedekind and 
Weber call the module “finite,” rather than “finitely generated.” Also, their modules are always 
submodules, of the function field in this case, and they omit mention of the ring the module is 
over, in this case the polynomial ring C[z]. (Translator’s note.) 


56 PART I 


holds for rational x only if 7; = 0, v2 = 0,..., @m = 0. A basis of the field Q is 
therefore rationally irreducible, and there is no system of more than n rationally 
independent functions in 2. 

We now prove the theorem: 


1. Each finitely generated module has a rationally independent basis. 
The proof is obtained immediately from the following lemma: 


If the polynomial functions 91,1, y2,1,---,Y¥m,1 have no common divisor, then 
one may determine other polynomial functions y1,2, y2,2,..-,Ym,m so that 
S- +92 Ym S12? 
Now if the functions a1, Q2,...,Q@m satisfy an equation 
m 
S- Wie = 0, 
t=1 
in which the polynomials y1,1,...,Ym,1 can be assumed to have no common divisor, 
one sets 


m 
S W201 = Bo, 
w=1 


Then the module [aj, a2,..., @m] is identical with the module [(1, 62,..., Bm] whose 
basis contains one function less. If the functions (, are still not rationally indepen- 
dent then one can reduce further in the same way, and finally arrive at an irreducible 
basis except in the case where the functions a, all vanish (a case we exclude from 
the concept of module). From now on we shall always understand a basis to be an 
irreducible basis. 


2. Although the preceding process finds many different bases for the same 
module, the number of functions in the basis is always the same, since a system 
with more functions cannot be rationally irreducible. Then if a,,a2,...,@m and 


19The result is correct and well known for m = 2. Thus if we suppose it proved for m— 1 we 
can satisfy the equation 


Y1,1 Y2,1 ses Um-1,1 
Y1,3-Y2,3 see Ym-1,3 | _ a 
= ? 
Y1sm Y2,m tae Ym1,m 
where 0 denotes the greatest common divisor of y1 1, y2,1,--.;Ym-—1,1, and hence when we deter- 


mine polynomials 2, y so that 2ym,1 — yO = (—1)™7! it follows that 


Y1,1 Y2,1 see YUm-1,1 Ym,1 
LY1,1 LY2,1 ZYm—1,1 
F) a so a y 
(0) Sh 


Y1,3 Y2,3 sss Ym-1,3 


Yim Y2,m aos Ym—-1,m 0 
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B81, B2,...,8m are two irreducible bases of the same module a, the fact that both 
generate a means that 
m m 
ak = Sal? B., Br i be qa, 
t=1 t=1 


where the coefficients p,q are polynomial functions of z. But this implies 
m 
Sar =0 oF 1 
w=1 


according as / differs from & or not, whence 
ep? wh SP ta ay? gh?) = 1 


and since both determinants are polynomial functions of z they must both be 
constant. 


3. Definition. A module a is said to be divisible by a module b, or 6 is said 
to be a divisor of a , or aa multiple of 6 (b goes into a), when each function in a 
also belongs to 6. We call 6 a strict?° divisor of a when a is divisible by 6 but not 
identical to 6.7! 

An immediate consequence of these definitions is: 

If a is divisible by b, and 6 is divisible by c, then a is also divisible by c. 


4. Definition. The collection m of all the functions belonging to both modules 
a and b, unless it consists only of the “zero” function, is a module (according to 
the general definition) called the least common multiple of a and 6b, because any 
multiple of both a and 6 is also a multiple of m. Similarly, the least common 
multiple of arbitrarily many modules a,6,c,... is the collection of functions that 
belong to all of a,b,c,.... One arrives at the same thing by successively replacing 
any two of the modules a,b,c,... by their least common multiple. 


5. Definition. If a is an arbitrary function in a and £ is an arbitrary function 
in b, then the collection of all functions of the form a+ 6 forms a module 0 called 
the greatest common divisor of the modules a and 6b. When a and 6 are finitely 
generated modules, so is their greatest common divisor. Namely, if 


a = [a1,Q2,..., ar], b = [(1, o,..., Bs] 
then 
v= [01, @2,.--, Ar, G1, Bo,..., Bs]. 

It follows from the definition of divisibility that 0 is a divisor of both a and b. 
Conversely, if 0’ is a divisor of both a and b then the functions @ and (, hence also 
the functions a + 3, belong to 0’. Therefore 0 is divisible by 0’. 

The definition of the greatest common divisor of an arbitrary number of mod- 
ules builds on this in the obvious way. 


” in order 


20T am using “strict” as a translation of the word “echt” here, rather than “proper,” 
to use “proper/improper” as the translations of “eigentlich/uneigentlich,” which occur from §21 
onwards. (Translator’s note.) 

21The concept of divisibility of modules is intuitively opposite to that of numbers, inasmuch 
as the divisor contains a greater quantity of functions than the multiple. (Translator’s note: 
However, the idea that “to divide is to contain” can be easily reconciled with ordinary concept 
of divisibility, if one bears in mind that b divides a if and only if the multiples of b include the 
multiples of a.) 
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6. Definition. If a is a module, a is an arbitrary function in a, and yp is an 
arbitrary function in Q, then the product ja or ay is the collection of all functions 
pia, where a belongs to a. It is again a module. If 


a= [a1,2,..., Op] 
is a finitely generated module, then 


a = [Ha1, a2, Sees » Mr] 


is likewise a finitely generated module, and wa = yb implies a = b when yp is 
nonzero. 


7. Definition. If a,6 are two modules, and a, are any functions in a,b 
respectively, then the product 


ab = ba=c 


is the collection of all products af and all sums of such products, i.e., all the 
functions that can be represented by the expression 


= SOR, 


This system of functions also constitutes a module, and indeed it is finitely 
generated when a and 6 are. If a and 6 are defined as in 5, then the rs functions 
a.8,, if they also form an irreducible collection, are a basis of ¢. A product of 
arbitrarily many modules a,6,c,... is built up in the obvious way, and satisfies 
the fundamental theorem on exchangeability of factors in multiplication. If the 
individual terms in such a product are all equal to a, and are m in number, then 
we denote it by a™, and it has the property 


qintm’ = aq 
In general the product ab is not divisible by a. However, we have the following 
theorem, whose proof is immediate from the definition 


Ifa is divisible by a1, and b by bj, then ab is divisible by a,b. 


8. Definition. The quotient & of two modules a, 6 is the collection of all those 
functions y with the property that ya is divisible by 6. This quotient, unless it 
consists of the single function “zero,” is itself a module c, as is immediately clear 
from the definition. The product & -a is always divisible by b, though not always 
equal to b. 


§5. Congruences 


SUMMARY AND COMMENTS 


The reason for the centrality of the module concept (and the 
reason for its name) is that a module a admits the relation of con- 
gruence modulo a. If we say that a = 8 (mod a) if a— 8 € a, then 
the basic properties of congruence follow from the closure properties 
of modules. 

In algebraic number theory the number of congruence classes 
is typically finite, but in algebraic function theory the congruence 
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classes form a vector space” over C. That is, there are algebraic 
functions Ay, A2,---,Am such that the functions 


CyAy + CoA2 + +++ + emAm 


represent the distinct congruence classes (mod a) as €1,C2,-.--,;Cm 
run through the distinct m-tuples of complex numbers. (Dedekind 
and Weber say that Aj, A2,.--,Am form a “complete system of 
remainders.” ) 

The same is true when a module 6 is considered relative to 
a module a—the congruence classes of 6 modulo a form a vector 


space. If the functions Aj, A2,.--, Am are linearly independent over 
C then they form a basis of this system, which is then written as 
(Ai, A2,-+-;Am)- 


Two functions a, @ are called congruent modulo the module a, 
a= 6 (moda), 


when the difference a — 6 belongs to the module a. 
The following theorems are immediate from the definition: 


1. Ifa= 8,68 =7 (mod a) then a = ¥ (mod a). 
2. If 0 is any divisor of a then a = 6 (mod a) implies a= 8 (mod 0d). 


3. If ~@= 6 (mod a) and yp is any function in Q, then wa = wb (mod pa), and 
conversely, the latter congruence follows from the former when p is nonzero. 


4. Ifa = 6,a, = 8 (mod a) then ata, = 6+ 8; (mod a). 


If Ay, A2,---,;An are given functions in Q, and c),c2,...,Cm are arbitrary con- 
stants, then the collection of all functions of the form 


C1A1 + cA Spe aeawet CmAm 


is called a vector space and is denoted by (Ai, A2,..-, Am). The system of functions 
Ai, A2,---;Am is called the basis of the vector space. The functions 1, A2,..-,Am 
are said to be linearly independent or a linearly irreducible system, if an equation 
(identity) of the form 

cyAy + c2A2 Spee Cin =0 
holds only when the constant coefficients c1,c2,...,Cm all vanish. 

We have the theorem that each vector space has a linearly irreducible basis. 
Because if cyA1 + coA2 +--+: + CmAm = 0 and c, is nonzero, then the vector space 
(Ai, A2,---;Am) is identical with the vector space (Az, A3,..-,; Am), whose basis con- 
tains one function less. If the latter is not linearly irreducible, then one can continue 
similarly. Here too we shall understand a “basis” to be an irreducible basis from 
now on. The number of functions in an irreducible basis of a vector space is always 
the same and is called the dimension of the vector space. If the dimension is m, 
the vector space is said to be m-tuple. Any m functions in an m-tuple vector space 
form an irreducible basis if and only if they are linearly independent. 


22Dedekind and Weber call it a Schaar. I have opted for the anachronistic, but accurate, 
translation “vector space” because any 19th-century English equivalent is likely to be misleading. 
(Translator’s note.) 
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The functions 1, A2,..., Am are called linearly independent modulo the module 
a if a congruence of the form 


c1A1 + CoA2 +++: + CmAm = 0 (mod a) 


holds only when the constant coefficients cj, c2,...,¢m vanish. Then two sums 
of the form >> ¢,A, with different values of the constant coefficients c, are always 
incongruent modulo the module a. 

Now let a and 6 be two modules, and suppose first that in 6 only finitely many 
functions Ay, A2,.--,Am are linearly independent modulo a. Then each function 8 
in b satisfies exactly one congruence of the form 


B=caA1 + c2A2 +:++ + CmAm (mod a) 


with constant coefficients c1,C2,...,Cm. The vector space (Aj, A2,---, Am) can thus 
be called a complete system of remainders”® of 6 modulo a, and 1, 2,...,Am its 
basis. One can write this symbolically as 


b= (Ai, A2,--+;Am) (mod a). 


If one chooses any system of m functions \,,5,.-.,;,, in b, then there are m 
congruences 


= >> knw. (mod a) 


with constants kp,,, and this system is a basis of a complete remainder system of b 
modulo a if and only if the determinant 


S 0 £hi,1k2,2 +++ hmm 


is nonzero. 


§6. The norm of one module relative to another 


SUMMARY AND COMMENTS 


In algebraic number theory, the norm of a module 6 relative to 
a module a, (b, a), is simply the number of congruence classes of 6 
modulo a. 

In algebraic function theory, as we saw in the previous sec- 
tion, the congruence classes of 6 modulo a form a vector space 
(Ai, A2;---;Am). To define a relative norm (b,a), Dedekind and 


Weber associate a determinant with (\1, A2,.-., Am)—the charac- 
teristic polynomial of multiplication by z on 6/a—which is of degree 
m. 


The multiplicative property of determinants then yields a mul- 
tiplicative property of the relative norm: if ¢ divides 6 and b divides 
a then (c,a) = (c,6)(6, a). 


231 have translated the German word “Rest” as “remainder” here, rather than the more 
common “residue,” to avoid conflict with the word “residue” that Dedekind and Weber use later, 
inspired by the concept of residue in complex analysis. (Translator’s note.) 
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If (Ai, A2,---,; Am) is a complete remainder system of b modulo a then, since z6 
is divisible by 6, there is a system of m? constants cy, satisfying the congruences 


zy =a 1A 4 
ZzAQ — C1,2A1 t €2,2A2 eats welt 2Am 


(mod a) 


ZAm — C1mA1 + C2,mA2 os a Cm,mAm 
and by solving this system one sees that each function A, and hence each function 
6 in the module b, may be converted into one in the module a by multiplying by 
the mth degree polynomial in z 


C11 — 2% Can ses Cm 
(b, a) = (-1)” C1,2 C22—-2 ... Cm 
a) = 
C1,m C2.m Oe eee 


It follows easily from the multiplication theorem for determinants that this function 
(b, a) is independent of the choice of basis 41, A2,.--, Am, and hence dependent only 
on the two modules a, 6. We call it the norm of a relative to b. 

If each function in 6 also belongs to a, so that 6 is divisible by a, then we set 
m = 0 and (6,a) = 1. If, contrary to the assumption above, there are more than 
a finite number of functions in b that are linearly independent modulo a, then it is 
agreed that (6, a) = 0. 


1. If m is the least common multiple, and 0 is the greatest common divisor, of 
a and 6, then each congruence between functions in b modulo a is equivalent to the 
same congruence modulo m. On the other hand, each function in 6 is congruent 
to a function in 0 modulo a, and conversely each function in 0 is congruent to a 
function in 6 modulo a. These remarks immediately yield the important theorem 


(b, a) = (6, m) = (0, a), 
which also remains true when (6, a) = 0. 


2. If the module a is divisible by the module 6, and 6 is divisible by the third 
module c, then 


(c,a) = (c, 6)(b, a). 
The theorem is obviously correct when one of the two norms (c, 6), (6, a) is zero. If 
this is not the case and if 


c= (P1, P2;- : -; Pr) (mod b), 
6 = (Ai, 2,.--,As) (mod a), 


then the functions 1, p2,...,Pr,A1,2,---,As together are linearly independent 


modulo a, because if 
Se CPi + SD cA, =0 (mod a), 


then it follows, since a is divisible by 6 and the functions 4, are in 6, that 


ye CP. =9 (mod 6), hence c, = 0, 


x cA, = 0 (mod a), hence c = 0. 
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Also, since each function 7 in ¢ satisfies a congruence of the form 


y= S- Cpr + ys cd, (mod a), 


the (r+ s)-tuple vector space (1, 92,.--; Pr; A1,A2,---,As) is a complete remainder 
system of ¢ modulo a , or 
¢ = (P1, P2,---; Pr, A1,A2,---,As) (mod a). 
Therefore if 
ZP1 = €1,1P1 rer Pr Bi, 
ZPr = CrP +++ + CrrPr + Br, 


where the e,,,, are constants and the (; are functions in b, so that 


E11 — 2 Cr 
(c,b) = (-1)" 
E1,r err — 4, 
and if also 
By S=hyyarArt-++ + hears 
Br S hypdAy tees +h As 
; ‘ moda 
ZAy = c1,1A1 or TF Cs,1As ( ) 
ZXA5 = C1,5A1 +:°++ + 65,5X5 
with constant coefficients h,.,C.,<., 80 that 
C11 2 Cs,1 
(6,a) = (—-1)* | ... 
C1,s Cs,s — % 
then it follows that 
Zp~1 =e11pi 4 Cr Prt hyiAr hs airs 
ZPr = €1,rP1 te TT Cryr Prva Ai rAd je yee I RsrAs 
z= e11A1 +++ + 65,15 (mod a) 
ZA; = C1,sA1 + + C5,8As 
whence 
€1,1—-2 Cr hin Asa 
— h h 
=(-] rt+s EC1,r Cr r z 17 Sr = : 
(,a) = (-1)""*| 9 ‘ one a [= bNb8) 
0 0 C1,s Cs,s— % 


3. When the basis functions (1, 82,..., 6s of a finitely generated module b = 
(C1, B2,.--, 6s] can be converted to functions in a module a by multiplication by 
nonzero polynomials in z, the norm (6, a) of a relative to 6 is nonzero. At the same 
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time, the least common multiple of a and 6 is a finitely generated module m, an 
irreducible basis of which is 

Ha = 41,1P1, 

fl = 41,291 + a2,22, 


Ls = @1,5P1 +t 2,582 Rts SE As,s2s; 


where the coefficients a,,,, are polynomial functions of z, and indeed such that 
(6, a) = @1,142,2°-+ s,s. 
To prove this important theorem we let a; be the greatest common divisor of 
a and [(;], let a be the greatest common divisor of a; and [9], etc., so that a, is 
the collection of all functions of the form 
On = at Bite + YrBr 


where a is a function in a and y1,...,y, are polynomial functions of z. Then a, is 
the greatest common divisor of a and b. Now since each module a, is divisible by 
the next, a,41, it follows from 1 and 2 that 
(6, a) = {a3 4) = (ds; 05-1) (O51; Oso9)>>* (di, 4); 
and hence it is a matter of determining (a,.,a,—1). But 
Oy = Ar-1 + YrBr = YrBr (mod a,-1), 
and by hypothesis there is a nonzero polynomial function 2, of z for which 
t,3, =0 (mod a) 
and hence also 
X,B, =0 (mod a,_1). 


Now if a,,, is a function of minimal degree m, among all the functions z,. satisfying 
the latter congruence, and also such that the coefficient of the highest power of z is 
1, then all other functions 2, satisfying this congruence are divisible by a,,,. This 
is because 


(Lp — Garr) br =O (mod a,_1) 


for any polynomial q, hence if x, is not divisible by a,,,, q¢ may be chosen so that 
Ly — Gay, is of lower degree than a,,,, contrary to hypothesis. 
Thus, if one sets 


Ur = arr + bpp 
and determines g so that the degree of b,., is smaller than m,, it follows that 
ay = br.r8- (mod ay_1), 
whence 
d= (8-,26py. 652" *B-) Gmod op_1): 
Consequently, when one for the moment sets 
Ore = Co + CZ + Om, 12" 2, 


k-1 
Ak =% Br, 
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it follows that 
ZAq = r2, ZArAQ = A3, 


ZAm» = CoAL cy A2 ae Cm,—1Am, (mod O,-—1), 
hence 
—z 1 0 0 
0 —z 1 0 
(Get Pe(H1 |cee seek cota ade jie Ope 
0 0 0 1 
—Co —Cy —C2 Crip 1% 


-1 
Pi, zB, ere a Pr, 
Ba, zBo, ’ z™M2—1 Bo, 
Bs, ZBs, es aaa 


is a basis for a complete remainder system of b modulo a, and that 
(6, a) = @1,192,2°°' As,s 
is of degree m = m, + m2 +---+ms. 
Now, since a;,-8, =0 (mod a,_j), a function uz, in a and polynomial functions 
Qx,r Yay be determined so that 
[br = O1,r01 + Garo +++++ arr Br. 


The functions 


fy = 41,11, 
[lg = 41,201 + a2,2Po, 


Ms = a1,5P1 a 2,592 2 ee Gs,sBs 


determined in this way are, since none of the functions a1,1,...,@s,; vanishes, ra- 
tionally independent. At the same time, they are all in both a and b, hence also in 
the least common multiple m of these two modules. It remains to prove that they 
form a basis of m. 


Let m, be the least common multiple of a and [44, G2,...,6,], ms =m, so that 
each of the modules m,,m2,...,m, is divisible by the next, and hence also by m, 
and let 


Vp = ramen + 2232 a ee Zr Br 
be a function in m,, hence also in a. 
This means 
2,3, =0 (mod a,_1), 
hence 
Zp = LAr, 


where x, is a polynomial function. Therefore 


Vp — Lp fly =O (modm,—1), 1 — 1p = 0, 
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whence 
Vp = © fy + LQM2 +++ + Lyf, 
and so 
m, = [M1 M2, ais » Mr], 
m= [M1 Ma, sta , [bs]. 
Q.E.D. 
Thus an irreducible basis of the module m contains precisely as many functions 
as an irreducible basis of 6. If one chooses another basis p,w4,..., 4, in place of 


[1, 2,--+;[s, then pi, u4,..., 4, may be expressed in the form 


Ma, = Op Br + 09,482 +++ + 45 68s 
with polynomial coefficients a’ ,, and it follows from §4, 2 that 


LK? 


(6, a) = const. S ta} 145 9 . ce 


4. In particular, if we make the assumption that a is likewise a finitely generated 
module, possessing an irreducible basis with the same number of functions as the 
basis of 6, and if a is divisible by 6 as well, and 

a= (a1, Q2,-.., Qs] 
then the polynomial functions b,,, may be determined in such a way that 
Op = 61% 81 + b2,n82 +--+ + bs,K 8s. 


Then the hypothesis of 3, that the functions (, are convertible to functions in a by 
multiplication by polynomial functions of z, is satisfied, as one sees by solution of 
this equation system. At the same time, a is itself the least common multiple of a 
and 6, and therefore 


(b, a) = const. S- +b, 1b2,2 ee bain: 


5. If mis the least common multiple of two modules a, 6 and if v is an arbitrary 
function in 2 then, as follows easily from the definition, vm is the least common 
multiple of va and vb. If (6,a) = 0 then (vb, va) = 0 also. But if (6,a) and v are 
nonzero, one gets 

(vb, va) = (6, a) 
when the basis functions 4, 3, of m and 6 in 3 are replaced by vy, v,. 


§7. The ideals in o 


SUMMARY AND COMMENTS 


An ideal in the ring o of integral algebraic functions is defined, 
as usual, as a set closed under sum and difference, and under prod- 
ucts with arbitrary members of o. As with modules in general, “to 
divide is to contain,” so o itself divides every ideal. Also as usual, 
a principal ideal is one consisting of all the multiples of a fixed 
member of o. 
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The norm N(a) of an ideal a is the relative norm (0,a). As 
shown in the previous section, N(a) is a polynomial, and the degree 
of this polynomial is called the degree of a. N(a) = 1 if and only 
if a =o, and if a@€ a then N(a) divides N(a). 

Congruences modulo an ideal a have the following convenient 
property, which does not hold for modules in general: po = p1, 
v =v (mod a) implies wv = p14 (mod a). 


A system a of integral algebraic functions of z in the field 2 is called an ideal 
when it satisfies the following conditions: 


I. The sum and difference of any two functions in a are again functions in a. 
II. The product of any function in a with any function in o (§3) is again a 
function in a. 


Every ideal is at the same time a module, and all terms and notations defined for 
modules can be applied to ideals. 

The module o (the system of all integral algebraic functions of z) is itself an 
ideal, and each ideal is divisible by 0. Likewise, when js is any nonzero function in 
o, the module oy (the system of all integral algebraic functions divisible by 1) is an 
ideal. Such an ideal is called a principal ideal. If w1,we,...,Wn is a basis of o then 


ou = [wip, W2fl,..- 5 Wn fa] 


and oy is the least common multiple of 0 and oy. It then follows from §6, 4 and 
definition (4) of §2 that 


(1) (o, of1) = const.N (2), 


and consequently it is nonzero. 
If a is any ideal and a is an arbitrary function in a then (by II) the principal 
ideal oa is divisible by a. It follows by 86, 2 that 


(2) (0, ow) = (0, a)(a, oa) 


and hence (0, a) is also nonzero. Now since a is again the least common multiple of 
a and 0, it follows by 86, 3 that a has an irreducible basis consisting of n integral 
algebraic functions a1, @2,...,@n, which therefore also form a basis of the field 2. 

The norm of a relative to 0, i.e., the polynomial function (0,a) of z, will be 
called the norm of the ideal a and denoted by N(a). The degree of this polynomial 
is called the degree of the ideal a. 


If 
a= [Q1,Q2,...,Qn], 0 = [w1,W2,..., Wn] 
and 
Ay = A, 1H, + A21W2 ++++ + An 1Wn, 
Ag = A ,2W1 + A2,2W2 +++ + An 2Wn, 


An = A1,nW1 ah A2,nW2 ebeeeetate Aan, nWn 


with polynomial coefficients a,,,, then it follows from 86, 4 that 


(3) N(a) = const. Je 1@1,142,2°°* An n- 
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Since each function in 0, and hence also the function “1,” is converted into a function 
in the ideal a by multiplication by N(a), N(a) is always a function in a. 
The norm of the ideal o is equal to 1, and conversely o is the only ideal with this 
property. Also, o is the only ideal that contains the function “1” (or any constant). 
If @ is a function in a then it follows from (1), (2), (3) that 


(4) N(a) = const.N(a)(a, oa), 


i.e., the norm of each function in a is divisible by the norm of a. 

Congruences modulo an ideal satisfy the following theorem, which distinguishes 
ideals from general modules. 

If 4, Wi, Vv, are functions in o that satisfy the congruences 


f= 141, v=™ (moda), 


then also 
pv = pi, (mod a). 


§8. Multiplication and division of ideals 


SUMMARY AND COMMENTS 


Multiplication and division of ideals follow from multiplication 
and division of modules in §4, but with extra features due to the 
existence of principal ideals, and with a specific focus on prime 
ideals. As in algebraic number theory the purpose of prime ideals 
is to recapture the unique prime factorization property that holds 
for the “ordinary” integral elements (ordinary integers in the case 
of algebraic number theory and polynomials in the case of algebraic 
function theory). 

Also involved in this theory are the least common multiple, m, 
and the greatest common divisor, 0, of ideals a and b. Their norms 
are shown to satisfy 


N(a)N(6) = N(m)N(0). 
It does not yet follow that the norm is multiplicative, but only that 
N(a)N(6) = N(a)N(6) 
for relatively prime a and b. 
However, it is shown that if 6 divides a then N(b) divides N(a). 
It follows, by splitting an ideal into factors of minimal (degree) norm, 
that a prime factorization exists for each ideal. 
The main step towards proving uniqueness of this prime factor- 


ization is also taken in this section. It is the prime divisor property: 
if a prime ideal p divides ab, then p divides a or p divides b. 


The basic properties I and II of ideals, together with the concepts of §4, lead 
first of all to: 


1. The least common multiple, greatest common divisor, and the product of two 
(or arbitrarily many) ideals are themselves ideals. Likewise, when v is a function 
in o, and a is an ideal, the product av is an ideal. 
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2. The product of ideals is divisible by each of its factors, and 
ao=a 
for each ideal a. This is because I, II imply that each function in ao is also a 


function in a and, since o contains the function “1,” it is true conversely that each 
function in a is also a function in ao. 


3. A principal ideal oy is divisible by a principal ideal ov if and only if the 
integral algebraic function ju is divisible by the integral algebraic function v. 


We now add the following definitions: 


4. Definition. A function a in o is said to be divisible by the ideal a when the 
principal ideal oa is divisible by a, i.e., when a is a function in a. 


5. Definition. Two ideals a,b are called relatively prime when their greatest 
common divisor is 0. The necessary and sufficient condition for this is that there 
exist a function a in a and a function ( in 6 such that 


a+6=1, 
or in other words that there exists a function a in a satisfying the congruence 
a = 1 (mod 6), or a function § in 6 satisfying the congruence @ = 1 (mod a). 


6. Definition. An ideal p different from o is called a prime ideal when no ideal 
apart from p and o divides p. 

On the basis of these definitions, we now obtain the following theorems on 
divisibility of ideals. 


7. If a,b are two ideals with least common multiple m and greatest common 
divisor 0, then it follows from §6, 1, 2 that 


N(m) = N(6)(6,m) = N(b)(6, a), 
N(a) = N(0)(0, a) = N(0)(6, a). 
Consequently (6, a) is nonzero and 
N(a)N(6) = N(m)N(0). 
8. If the ideal a is divisible by the ideal 6 then, by §6, 2, 
N(a) = (6,a).N(6), 
hence N(a) is divisible by N(b). 

In particular, if (6,a) = 1, then 6 is also divisible by a and it follows that: 

9. If a is divisible by b and at the same time N(a) = N(6), then a = 6, ie., 
the two ideals are identical. 

10. If a is divisible by a,, and 6 by by, then ab is divisible by a,b; (84, 7). 

11. If an ideal a is divisible by a principal ideal oy, then all functions in a are 
of the form Gy, and the collection of all the functions ( is again an ideal, b, so that 
one can set 

a= pb. 


12. If u is an arbitrary nonzero function in o and the ideal ay is divisible by 
the ideal by, then a is divisible by 6 and ay = by implies a = b. 
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13. The least common multiple of two ideals a, ov, one of which is a principal 
ideal, has the form tv by 11, where t is an ideal. Since, on the other hand, av is a 
common multiple of a and ov, and hence divisible by tv, it follows by 12 that t is 
a divisor of a. 


14. If a is an ideal and v is a function in o then, by §6, 2, 5: 
(0, av) = (0, ov)(ov, av) = (0, 0v)(0, a), 
hence 
N(av) = const.N(a)N(v). 
Thus if tv is the least common multiple, and 0 the greatest common divisor, of the 
two ideals a, ov, it follows from 7 that 


N(a) = N(t)N(0). 


15. Each ideal a different from o is divisible by a prime ideal p. 

If a is not a prime ideal, then it has at least one strict divisor different from 0, 
and among these let p be one whose norm has minimal degree. The latter can have 
no strict divisor p’ different from 0, otherwise p’ would also be a divisor of a and 
(by 8) N(p’) would have lower degree than N(p). This contradicts the hypothesis 
on p, and hence p is a prime ideal. 


16. If a is relatively prime to 6 then ab is the least common multiple of a and 
b, and hence each ideal divisible by both a and 6 is also divisible by the product 
ab. 

For, by hypothesis, there are functions a1, 6; in a, 6 respectively such that 


a, + 6, =1, 


by (5). On the other hand, if a = £ is a function in the least common multiple m 
of a and b, then the above equation gives 


a=6B=a18+ a6, 


showing it to be a function in ab. This means that m is divisible by ab, and since, 
conversely, (by 2) ab is divisible by m, m is identical to ab, and it follows from 7 
that in this case 
N(ab) = N(a)N(b). 
17. If a is an arbitrary ideal, and p is a prime ideal, then either a is divisible 
by p or a is relatively prime to p. Because, there being no divisors of p other than 
o and p, the greatest common divisor of a and p can be nothing but 0 or p. 


18. If a is relatively prime to 6 and to c, then a is also relatively prime to be. 
By hypothesis there are (by 5) functions 6, y in b,c satisfying the congruences 


B=1, y=1(mod a), 


hence by $7 
By = 1 (mod a). 
Since 67 is in bc, this proves the assertion. 

It also follows that, when the product ab is divisible by a prime ideal p, at least 
one of the two factors a,6 must be divisible by p, and, applied to principal ideals, 
this means that when the product uv of two integral algebraic functions is in p at 
least one of the two factors 4, must be in p. 
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19. If a is relatively prime to ¢ and ab is divisible by c, then 6 is divisible by c. 
By hypothesis there is a function @ in a satisfying the congruence 


a@=1 (mod ¢). 
It follows that if @ is any function in b then 
8 =a, which is =0 (mod c) 
by hypothesis. Consequently, @ is in c, and hence 6 is divisible by c. 


§9. Laws of divisibility of ideals 


SUMMARY AND COMMENTS 


In ordinary arithmetic, the prime divisor property, which was 
proved by Euclid, is only a short step away from unique prime fac- 
torization. In algebraic number theory this is not the case (see 
Dedekind (1877), §§22, 23), because if p divides a it does not easily 
follow that a = pe for some c. However, in algebraic function theory 
it is only a short step, carried out in this section. 

The key fact, proved in subsection 4, is that if an ideal a divides 
an ideal ¢ then c = ab for some ideal b. This fact is obtainable more 
simply for algebraic functions than for algebraic numbers, thanks 
to some advantages that polynomial functions have over ordinary 
integers, notably their factorization into linear factors. 

A useful consequence of unique prime factorization is the mul- 
tiplicative property of the norm, N(ab) = N(a)N(6), obtained in 
88 only for relatively prime a and 6. The general case follows im- 
mediately from this special case once we have a unique prime ideal 
factorization. 

Another important way in which the theory of prime ideals is 
simpler in the function field case is the following: the prime ideals 
are precisely those of first degree (see below, subsection 7). This 
is due to the fact that each polynomial splits into linear factors, 
thanks to the fundamental theorem of algebra. 


All these theorems, which are mostly immediate from the definition of ideal, do 
not suffice to prove the complete analogy between the laws of divisibility of ideals 
and those of polynomial functions. Our proof of this will be based on the following 
theorem: 


1. Ifa is an ideal and k is an arbitrary polynomial function of z, then a function 
a ina may be chosen so that (a,oaw) has no common divisor with k.74 
Namely, if 
a = [a1,Q2,..., An], 
o= [w1, we, ene Wn], 
24The fact that this theorem can be proved at this early stage distinguishes the theory of 


algebraic functions from that of algebraic numbers, and allows the former to proceed significantly 
more simply than the latter. 
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and if @ is any function in a, then polynomial functions x), may be determined so 
that 


and (by §6, 4) 


(a, oa) = const. ; £211%2,2°°*Xnn- 


Now if )> +21,1%2,2+++2n,n is divisible by a linear factor z—c of k, then it is possible 
to determine a function w in 6, not divisible by z—c, and a function a’ in a so that 
aw = (z—c)a’.® 
If one now sets 
tz—c)-w=.", 
where ¢ is an undetermined constant, then 
N(w’) =t"(z—c)” + at” 1 (2 —c)? 1 4+ --- + ag_it(z — 0) tan, 


where the coefficients a1, a2,...,@n independent of t are polynomial functions of z. 

It is now impossible for a; to be divisible by z—c, ag by (z —c)?, ..., and ap, 
by (z — ce)"; otherwise -*- would be an integral algebraic function (§2, 5; §3, 4), 
contrary to hypothesis. Therefore not all terms of N(w’) are divisible by (z — c)", 


and if (z—c)"~" is the highest power of z — c that is so divisible, then r > 0 and 


N / 
Cee =f" (2-6) +t?) ++ +b, 4t +0, = FO, 
(z = c)” r 
where the polynomial functions 61, b2,...,5, do not all vanish for z = c. Conse- 
quently there are only a finite number of constant values t for which f(t) is divisible 


by z-c. 

If z—c’ is a linear function of k different from z—c, then f(t) is also divisible 
by z—c’ for only finitely many values of t. It follows that one can arrange the 
value of t so that N(w’) is not divisible by (z—c)” or any other linear factor of k.?° 
When this is done, if one sets 

ta—a’ =a", 
which is likewise a function in a, then it follows that 
aw’ = (z—c)a”, 
N(@)N(w’) 


Ne) (z—c)” 


eS 


25Namely, if the determinant 5*+a1,1%2,2---2n,n is divisible by z —c, i.e., if it vanishes 
for z = c, then one can determine a system of constants c1,C2,...,Cn, not all zero, so that the 
polynomial functions 


C1€p,1 + Cota +++ + Ce@en (k =1,2,...,n) 


vanish for z = c, and hence are divisible by z — c, and one then sets 


W = C1W1 + CQ2W2 + +++ + CnWn- 


26This conclusion is not valid for the analogous question in number theory. 
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and therefore, since 
N(@) 
N(a) 


(a, o~) = const. 


by §7, (4), we have: 
(a, oa) N(w’) 

(z—c)” | 
Thus the function (a,oa”) contains the factor z — c at least once less often than 
does (a,oa@), which at the same time can contain no other linear factor of k more 
often than does (a,oa). The theorem therefore follows by repeated application of 
this process. 


(a, oa”) = const. 


2. Each ideal a can be expressed as the greatest common divisor of two principal 
ideals op, ov, one of which can be taken as an arbitrary ideal divisible by a. 

Proof. One chooses an arbitrary nonzero function v in a and (by 1) a second 
function yz such that the two functions (a, ov) and (a, oj) have no common divisor. 
Now if a is any function in a then it follows by 86 that (a, oj)a is in ou and (a, ov)a 
is in ov, so there are two functions w,w’ in o for which 


(a,ouja= pw, (aov)a=vw"'. 
Hence if one chooses two polynomial functions g,h of z satisfying 
g(a, ou) + h(a, ov) = 1, 
as is possible by the hypothesis on (a, oj) and (a, ov), then it follows that 
a= guw + hvu", 


i.e., a is divisible by the greatest common divisor of oj: and ov. And since the latter, 
conversely, is divisible by a (since oj and ov are divisible by a), it is equal to a. 
Q.E.D. 


3. Each ideal a can be converted into a principal ideal oy, = am by multiplying 
by an ideal m.2” 

Proof. By 1, one chooses a function yz in a such that (a, oj) has no divisor in 
common with N(a), and then a second function v such that (a,ov) has no divisor 
in common with (a,ow). Then, by 2, a is the greatest common divisor of oj and 
ov. By §8, 13, the least common multiple of oy and ov is of the form mv, where m 
is a divisor of ou. Then by §8, 14 


N (op) 
N(a) 
hence, by hypothesis, it has no divisor in common with N(a). Thus if one again 
determines two polynomial functions g,h of z so that 


gN(m) + hN(a) = 1, 


N(m) = = (a, op), 


then it follows by §8, 5, since N(m) is in m and N(a) is in a, that m and a are 
relatively prime ideals. Hence, by 88, 16, 


N(ma) = N(m)N(a) = N(op). 
27Qne can choose the ideal m so that it is also relatively prime to an arbitrary ideal b. This is 


achieved by taking the function yz so that (a, oj) = N(m) has no divisor in common with N(a)N(6) 
(§8, 8). 
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Now since op is divisible by m and a, and hence also by ma (88, 16), it follows by 
88, 9 that 
ma = op. 
Q.E.D. 


4. If an ideal ¢ is divisible by an ideal a, then there is one and only one ideal 6 
that satisfies the condition 
ab=c. 
It is called the quotient of ¢ by a.78 
If ab is divisible by ab’ then 6 is divisible by 6’ and ab = ab’ implies b = b’. 
Proof. Suppose c is divisible by a and suppose (by 3) that am = ow. Then cm is 
also divisible by am = oy and consequently cm = by (88, 10, 11). Thus, multiplying 
the last equation by a, 
cu = abs, 
and by §8, 12 
c= ab, 
proving the first part of the theorem.?? 
If also ab is divisible by ab’ then (§8, 10) ub is divisible by ub, hence b is 
divisible by 6’. If ab = ab’ then it follows that wb = yb’, and consequently b = b’. 


5. Each ideal different from o is either a prime ideal, or else uniquely expressible 
as a product of prime ideals. 

Proof. If the ideal a is different from o then (by 88, 15) it is divisible by a prime 
ideal p; and hence (by 4) a = p11, where aj is a strict divisor of a (because a, = a 
implies py = 0 by 4). Thus the degree of N(a,) is lower than that of N(a). If ay is 
different from o, then one concludes similarly that a; = p2a2, where the degree of 
N(ag) is lower than that of N(a,). Continuing in this way, after a finite number of 
steps one arrives at an ideal a,_1 = p,a, with N(a,) = 1, so a, = 0. Therefore 


a= pipo---Ppr. 


If there were a second such decomposition, say 


Pip2:::*Pr = 41q2°--qs, 
then (by §8, 18) at least one of the prime ideals p1,p2,...,p,-, say pi, must be 
divisible by q, and hence equal to q,. Then, by 4, 


Pops --- Pr = qaqz--- qs. 
From this one concludes similarly that p2 = qo, etc. 

Collecting equal primes in the decomposition just obtained, one can set 

a= pL py? Py” 

Then a divisor a, of a can be divisible by no prime ideals except pi, p2,..., Pr, 
and no more often than a is. Thus one obtains all the divisors of a, the number of 
which is finite and equal to (e; +1)(e2 + 1)---(e, +1), by letting the exponents h, 
in 
hy 


pro ee 


28Recall that this was the particular difficulty in algebraic number theory pointed by 
Dedekind (1877). (Translator’s note.) 
29This definition of quotient of two ideals agrees completely with that given in 84, 8. 
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run through the numbers 0,1,2,...,e, (where p° is understood to be the ideal 0). 
If a,b are two ideals 


a= pips pe; b= pipe ---pir 
(where some of the exponents e, f can be zero), then one obtains the greatest 
common divisor and the least common multiple of a and 6 in the form 
PUPS Pe 
by taking gj, 92,---., gr to be the least members of the respective pairs e1, f1; €2, fo; 
..; €r, fy in the former case, and the greatest in the latter case. 
6. If a,b are any two ideals, then 
N(ab) = N(a)N(6). 
Proof. Suppose, as in 5, that a = p,a,. Then, since a; is a strict divisor of 
a, there is a function 7 in a, not divisible by a. The least common multiple and 
the greatest common divisor of a and 07 are p,7 and a, respectively, as follows 
immediately (by 5) from the decomposition of a and oy into their prime factors. 
But then it follows from §8, 14 that 
N(a) = N(p1)N(a1). 
Repeating this argument for a; etc., we get 
N(a) = N(pi)N(p2) ++ N (pr), 
when a = pip2---p,, and this gives 
N(ab) = N(a)N(6). 
7. Each prime ideal is an ideal of first degree (§7) and, conversely, each ideal 
of first degree is a prime ideal.°° 
Proof. If p is a prime ideal, then N(p) is divisible by p, and hence at least one 
of the linear factors of N(p), say z—c, is divisible by p (88, 18). If w is an arbitrary 
function in 0, satisfying the equation: 
a” aw hehe, gw+ ae =O, 
then, by replacing the polynomial functions aj, a2,...,@, by their constant remain- 


ders al”), a, sRaty a) modulo z — c, one obtains the integral algebraic function 


w"™ + a) yn-l ie a), 


w+ a), 
and decomposing this into linear factors (w — bi), (w — 62),...,(w — bn): 
(w — b1)(w — bg)+++ (Ww — bn) = (z — c)w’ = 0 (mod p). 
Thus at least one of the factors w — 61,w — b2,... must be divisible by p, ie., 
w = b (mod p), 


where b is a constant. Since this means every function in 0 is congruent to a constant 
(mod p), it follows by §6 that (0,p) = N(p) = z—c is a linear function of z, and 
the first part of the assertion is proved. 

Conversely: if q is an ideal of first degree, and 


N(q)=2-6 


30This theorem distinguishes the theory of algebraic functions from the analogous theory of 
algebraic numbers. 
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then q is certainly divisible by a prime ideal p, and since N(q) is divisible by N(p), 
N(p) = N(q) = z-—c. Hence, by §8, 9, 
p=4q. 

It follows from this that the degree of an ideal is equal to the number of factors 

in its prime decomposition. Therefore if 
o(z —¢) = Py Ps*P3' 
then 
ey tegt+eg3+::-=7N. 


It also follows that a polynomial function of z is divisible by a prime ideal p if and 
only if it is divisible by the norm of p. 


§10. Complementary bases of the field 


SUMMARY AND COMMENTS 


For each basis a1,...,Q@, of © there is a basis aj,...,a%,, 


called the complementary basis. It is defined with the help of the 
trace and has discriminant 


AG), e.25),) = 2/Al ais. On): 


% 


When aj,..., Qn Is the basis of a module a, the complementary ba- 
sis is the basis of a complementary module a’, which is independent 
of the basis chosen for a. 

In particular, when e¢ is the complementary module of o, and 
D is the discriminant of 2, then De is an ideal. The module e¢ is 
related to ramification in §11, and it comes up again in §23, where 
it is shown that if w € o then de € e. Finally, in §32, e is used 
to conclude the proof of the residue theorem. The ideal f, which 
follows on the heels of ¢ in this section and the next, also reappears 
in §23 as the “ideal of the double points.” 

The general concept of complementary module (more precisely, 
the related concept of supplementary polygon) has a key role to play 
in the proof of the Riemann-Roch theorem, in §27. The concept of 
complementary basis is also involved in the proof of Abel’s theorem, 
§20. 

Complementary modules can be viewed as the module coun- 
terpart of dual vector spaces, and they are introduced as such in 
Eichler (1966), where their role is highlighted in a “Riemann-Roch 
theorem for linear divisors” on p. 24. However, it is still a long road 
to the classical Riemann-Roch theorem. For Dedekind and Weber 
the crucial extra ingredient is the concept of differential, introduced 
in §23. Already in subsection 8 of this section one sees a relationship 
between the concepts of complementary basis and the derivative of 
a polynomial. 
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1. Definition. If the functions ay,a2,...,@, form a basis of , and if one 
makes the abbreviations 


Tr(QpQs) = ar,s = As,r, 


A(a1, Q2,---;QAn) = S 5 £41,102,.2-.-dnn =a (§2), 


then, since a is nonzero, it is possible to determine a system of functions a4, a4,..., a, 
by the linear equations 


n 
(1) ae > Ari, 
w=1 


and since 


/ / / 
A(a},5, or G10) = a 
is nonzero, the functions a,,a5,...,a/, likewise form a basis of 9. It will be called 
the complementary basis of a1,Q2,...,Qn- 


2. If 6p, denotes the number 1 or 0 according as r,s are equal or not,°! where 
the indices r, s belong to the series 1,2,...,n, then 


(2) Tropa, = Oyas 


because it follows by solution of the equations (1) that 
a. = S- dy 40s 
L 
Ges aa Core S- BiG, = Ores 
lb 


whence: 
/ / fs / == , = 
Ara, = y a, Ar; Tr(apa,) = y G, 6Q,r = Ors. 
Ll L 


Conversely, if a function system {, satisfies the conditions Tr(a,6,) = 6,,, then 
6, = a’,. Because, if one sets 6, = >>, b,,,0/,, then it follows from (2) that 
brs = Tr(BsQr) = drs. 
This implies that the relation between the a, and the a’, is symmetric, i.e., that 
the basis aj, Q2,...,Qn is complementary to a},a4,...,a/,. 
3. If is an arbitrary function in , then one can always set 
n= Se, = Seal, 
and it follows by application of (2) that 
nm =Tr(nal), a! = Tr(a), 
hence: 


(3) n= >) a.Tr(na{) = >) ai Tr(a). 


4. If 7 is any nonzero function in 2, then 
! ! ! 
Oy Oe On 
aan ae sey — 
1) 1) 1) 
31Dedekind and Weber denote this number by (r,s). But since this notation is already 
overused, I have changed it to the now-familiar Kronecker delta notation. (Translator’s note.) 
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is the basis complementary to 7a1,Q2,...,7Qn. This follows from 2 because 
f 
Tr (na, . “) = Tr(a,al,) = Ops. 
UT 
5. When two bases of 2, a1,Q@2,...,Q@n and 61, B2,...,8n, are connected by 


the n equations 
Bs = S- Z1,sy 
L 


with rational coefficients x,,, their complementary bases are connected by the n 


equations 
/ / 
a, = : Lr, 
L 


(transposed substitution). This is an immediate consequence of 3 because 


/ 
Lys = Tr(a,Bs). 
/ 
) aa, =1, 
L 
Rok / 
y Ay 1h, Ay = y a, ay = 1. 
Lu! Lu! 


6. One has 


hence: 


Namely, if one first sets 


y a,a, = 0, 


then it follows from 3 (applied to the functions na,) that 
Oy = S- a.Tr(na,a'). 


Hence, by definition of the trace in §2, (5), 


Tr(n) = be Tr(na,a),) =Tr (~ no.,0’,) ; 
So 
Tr(no) = Tr(n), 
and when one substitutes 7 = o and then 7 = 1 in 3, 


— S- a.Tr(ca\) = S\a.Tr(a') =i 


We now look in more detail at the construction of the complementary bases in 
two special cases: 


7. Let w1,w2,...,Wn be a basis of o and let €1,€1,...,€, be the complementary 
basis (of 2). Let 
Crs = €s,r = Tr(wrws), 
which are polynomial functions, and let 
D = const. > He1,1€2,2°** enn 
be the discriminant of Q. Then by 2 


78 PART I 


whence it follows that the functions De, are integral algebraic functions. But it 
also follows from 6 that 


OD 
P= 2 ea 
whence 
1 OD OAD 
Er€s D2 Do Ben, Oe, ; Des, ¥ Do ww! 


_1 ap fo ga dea te ea) 
+ pe Oey, 0€ 5,1! Oey, Oey! oo 


and then it follows, by a well-known determinant ae that 


1 OD 
Fay Dens e D2 ieee i te u Deo 


Ep€sg = 


which yields the important result that the functions De,¢, are also integral algebraic 
functions. 


8. Let @ be a function in 2 such that 1,0,07,...,0"~! form a basis of 2, and 
suppose 


(4) f(0) = 0" +a,0"* +--+ + Gn_19 + an = 0 
is irreducible with rational coefficients a. We seek the complementary basis to 
1,0,07,...,0°—!. For an undetermined constant t we set 
t = 

x =o + mt + nat? +--+ tnt", 
hence 

No = Qn—-1 + An—20 +++++ a,0"-? + grt, 

M = On—2 + dm—30 + +++ + 0"-?, 
(5) NS Ser rates Oc ee emer ene Stor aee 

M2 = 41 +9, 

Mn-1 = 1 
and so the functions 70,71,..-,7n—1 likewise form a basis of , since the determi- 


nant of the equations (5) equals (—1)?""~), and hence is nonzero. Each function 
¢ may therefore be expressed in the form 


¢ = yoo + yim +°°° + Yn-1Mn-1- 


We now continue the series of rational functions yo, y1,..-,Yn—1 by determining 
functions Yn, Y¥n+i,-.. by the recursion 
(6) QanYr + Qn—-1Ur4+1 Shee eae a2Ur+n—2 + Q1Ur+n-1 + Urtn = 0. 
Now, by (5) 
Ano = —AnTMn-1; 
Om = 0 — @n—-17-1; 
(7) Anz = — Gn—2M—-1, 


O7m—1 = Mn—2 — @1%n-1; 
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hence 
CO = yino + yom +++ + Yn—17n-2 + YnMn-1; 


and similarly for each positive integer r: 


CO” = yrno + Yrtim +::+ + Yrtn 21 2+ Yrtn 1Mn-1) 


or, when one expresses 79,71,---,7n—1 in terms of 1,0,07,...,0"71: 
CO" = of +064 29? +... 40 97-1, 
where 
Oe) Sdn Peidgee +b yy B01 Fea, 
oh) = Ypdn—2 + Yr41dn—3 +++ Yrtn—2, 


ee = YrQ1 + Yr+i; 


ra = Ur- 
Hence (by the definition of Tr, §2, (5)), 


(n-1) 
n—-1 


Tr(6) = 2p) tay) tay +. +e 
= Yoan—1 + 2y1dn_2 + +++ + (n— 1)yn_201 + NYn—1- 
Thus, applying this to ¢ = n;: 
Tr(nr) = (r+ )an-1-r; Tr(jm-1-r) = (n—r)ar, 


where ag is set equal to 1. 
Hence if one makes the abbreviation? 


Tr(@") = s,, 
then it follows from (5), for r < n, that 
(8) (n —1)dy = Gp89 + Ap—181 + +++ + G18,-1 + Sy 
and from (4) that in general 
(9) 0 = AnSr + Gn—18r41 +°°+ + O18p4n—1 + Sr4n- 


But these formulae also imply 


f'(0) = nO™-14 (n—1)a,07-1 +--+ + 2an_29 + Gn—1 
(10) = Soto + $1 +++ + Sn-1%m-1, 
0” f'(8) = $Srijo 7 S411 eee Srtin—2l)n—2 ie Srtn—-1ln-1- 


Bearing in mind the value of the determinant for the equation system (5), and the 
definitions of norm and discriminant in §2, (4) and (12), we now get the important 


32Remember that Dedekind and Weber use S for trace (=“Spur” in German); this accounts 
for the notational choice of lower case s. (Translator’s note.) 
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formula 
80 $1 wee Spai 
(11) NF‘) = (—1)3MO—D| SE 82 Sn 
Sn-1 Sn +++ $2n-2 


S12" VA 60 550). 


Looking back to Definition 1, we see that the equations (10) also yield the comple- 
mentary basis of 1,0,07,...,0"71: 


ui) ™ ™M-1 

FO)? f"(0)’ " f'(8) 
9. If a = [a1,Q2,...,@,] denotes a module whose basis is also a basis of Q, 
then the basis a,a4,...,a/, of Q complementary to a1, Q2,...,Q@n gives another 
module a’ = [a{,a4,...,a,] called the complementary module of a. The latter, as 


follows immediately from 5, together with §4, 2, is independent of the choice of 
basis for a. 


10. We consider in particular the module e = [€1,€2,...,€n] complementary to 
0 = [w1,W2,..., Wn]. If we set 
ws = > ewe, 
L 
then by 3 


ee = Tr(wrwsé.) 


€ 
is a polynomial function of z, and it follows that 


Wye = ee,, 


It follows from this that the module oe (§4, 7) is divisible by e . On the other hand, 
since o contains the function 1, e is divisible by oe, hence 


oe=—e. 


Le., the module e, although it is not composed completely of integral algebraic 
functions, has the characteristic property I, §7, of ideals. Consequently, the same 
is true of the module e?. Since the two modules De, De? contain only integral 
algebraic functions, by 7, they are both ideals, and it follows from 7 that 


N(De) = D”"". 


11. If 6 is a function in 0 with the property that 1,0,0?,...,0”~+ forms a basis 
of Q, so that the coefficients in the irreducible equation 


f(0) = 0" + a)0" 1 +--+ Gp 0 +a, =0 


are polynomial functions of z, then for r = 0,1,2,...,2—1 one can determine the 
polynomial functions k{”) such that 


= a Ree: 
u=1 
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Applying Theorems 5 and 8 to this, one gets 


f'(O)es = kO nog a kon at eae kn, 1, 


which implies that the module 


f(@)e=F 


81 


contains only integral algebraic functions. One concludes from 10 that the latter is 


an ideal. 


§11. The ramification ideal 


SUMMARY AND COMMENTS 


The ramification ideal, as its name suggests, is associated with 


the ramification points of a Riemann surface. Unfortunately, “points” 


will not be introduced until §14, and “ramification points” only in 
§16, so the meaning of ramification ideal will probably be unclear 
until we arrive at $16. 

Nevertheless, a glimpse of ramification may be caught near the 
end of subsection 2, where it is mentioned that “there are only 
a finite number of linear functions z — c divisible by the square 
of a prime ideal.” The corresponding finitely many values of c 
correspond to ramification points. As mentioned in Section 4 of the 
Translator’s Introduction, ramification is crucial for the definition of 
genus in the Dedekind-Weber concept of Riemann surface. 

In this section another important property of the ramification 
ideal comes to light: its norm N(3) is the discriminant D of the 
field Q. In this way D acquires added meaning as the norm of an 
ideal. 

If one recalls the classical concept of discriminant, as a function 
of polynomials of degree n that vanishes precisely for polynomials 
with less than n distinct roots, then one can expect a connection 
between the discriminant and ramification. Assuming that 22 some- 
how gives rise to a “Riemann surface” of n sheets, the discriminant 
ought to vanish where the n sheets are not distinct, that is, at 
ramification points. 

Continuing this train of thought, we might expect the degree 
of the discriminant to give the number of ramification points. This 
is shown to be the case in §16. 


1. Lemma. If any two of the ideals a,6,c,... are relatively prime, then there is 


a function which, modulo each of them, is congruent to a given function in 0.°% 


Proof. One sets 


m= abe---= aa, = bby = ccy 


33 form of the result generally known as the Chinese Remainder Theorem. (Translator’s 


note.) 
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The greatest common divisor of a; = bc--- ,6; = ac--- ,cy = ab--- is then equal 
to 0, since no prime ideal can simultaneously divide a1, 61,¢1,... . Hence (§4, 5) 
one can choose a; from aj, 6; from 61, 7; from ¢1, ... so that 
op Pit Fs SH, 

hence 

ay ’ fen 0, V1 ’ (mod a), 

ay= 0, By ie Be Yt = 4, (mod b), 

ay 0, fen = 0, y= 41; (mod te) 


Therefore, if A, u,v,... are given functions in 0, 


w= Aa, + bi +vy1+--: (mod m) 
satisfies the conditions 
w=X(moda), w=pu(modb), w=v (mode), 
2. Let p,pi,pe,... be the distinct prime ideals dividing an arbitrary linear 
function z—c, and let 
o(z—c)=p'pyps?---, etertezgt---=n (89,7). 


One chooses functions A, Az, A2,... divisible respectively by p,p1, p2,... but not by 
p?,p?,p3,... and lets b, b1,b2,... be arbitrary but distinct constants. Then by 1 it 
is possible to determine a function ¢ satisfying the congruences 


¢ =b+. (mod p>), ¢=h4+”A1 (mod pi), C= bot+r2 (mod p>), yee “9 
hence 
¢=b(modp), ¢=bi (modpi), ¢€=b2(modpfa), ... , 
so that, if a denotes any one of the constants, ¢—a will be divisible by at most one of 
the prime ideals p, pi, p2,..., and never by its square. Therefore, if y(t) = [[(t— a) 
is an integral algebraic function of the variable t with constant coefficients, then 
y(¢) = [](¢ — a) is divisible by p™ if and only if y(t) is algebraically divisible by 
(t —b)™, and when p™ is the highest power of p dividing y(¢), p+ is the highest 
power of p dividing y’(¢). Consequently, y(¢) is divisible by z — c only if y(t) is 
divisible by the nth degree function 
W(t) = (E= B)°(E = br) (b= ba)" 
It follows that the congruence 
to tayC+agC? +--+ +a,107 1S (mod z — c) 


is satisfied only by those polynomial functions x that are all divisible by z—c. Thus 
if one sets 


1 = Ow, buy fe bO wn, 
C= KY, + oP wy feet kOwn, 
C= kw, ks Nig kun, 
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where Ko) KO) ... are polynomial functions of z and w 1, we2,...,Wn is a basis of 0, 


then the determinant 
RSS er eee 


either vanishes identically or is divisible by z —c (cf. the note to §9, 1). 
Thus it follows that 


N(t—¢) = f(t,z) 
is irreducible. Now since f(¢,z) = 0, so that f(¢,c) is divisible by z—c, f(t,c) 
must be divisible by w(t). Then, since both functions are of the same degree, 
f(t,c) = v(t), 
whence one concludes (for later use) that 


Tr(¢) = eb+e 1b; + e2b2 +--+ (mod z—c) 


and, applying the same considerations to the functions ¢?, C?,..., which is certainly 
valid if the constants 0b are all nonzero, 

Tr(¢?) = eb? + e1b7 + e2b3 +--- (mod z—c), 

Tr(¢°) = eb? + e,b} + e2b3 + (mod z —c), 


Thus if p° is the highest power of p dividing f(¢,c), in which case p°~! is the 
highest power dividing f’(¢,c), one has, since 


f'(G¢) = f'(G,2) (mod p*), 
that p+ is also the highest power of p dividing f’(¢, z). This implies that 


of'(¢,2) = mp ip ts, 
where m, and hence N(m), is relatively prime to z — c. 
Now if D is the discriminant of Q it follows from this, from §10, (11), and from 
§2, (13) that (apart from constant factors) 


Nf'(G,2) = AG, 6,07,-.-. 0°") = Dk? = (z- 0)" N(m), 


where s is the number of distinct prime ideals p, pi, p2,... dividing z—c. And since 
k and N(m) are not divisible by z — c, (z — c)"~* is the highest power of z — c 
dividing D. Consequently: 


(1) D=|]-0"*, 


where the product is taken over all linear expressions z — c divisible by less than n 
distinct prime factors, and hence divisible by a second or higher power of a prime 
ideal. 

Thus there are only a finite number of linear functions z —c divisible by the 
square of a prime ideal. 

We now set 


(2) s=[ [9 ", 


where the product is taken over all the prime ideals p divisible by a power of their 
norms higher than the first—namely, the eth—and call this ideal 3 the ramification 
ideal. It follows immediately from (1) and (2) that 


(3) N(3) =D. 
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Since we also have n — s > e — 1, so e(n— 8) — 2(e— 1) > (e— 1)(e— 2) > 0, Dis 
divisible by p?(°—)), hence also by 37, so we can set 

(4) oD =232, N(2?) =D", 

where 0 also denotes an ideal. 


3. If a function p in o is divisible by each prime ideal dividing z—c, then T’r(p) 
is divisible by z —c. 
Proof. Let ¢ be the same function as in 2, so that one can set: 


gp = ato +446 +4907 +--+ +ap_1071, 
where the coefficients x, 79, 41,...,@,—1 are polynomial functions of z without com- 
mon divisor, the first of which is not divisible by z — c (cf. 2). It follows from our 
hypothesis on the function p, and taking the constants b with the same meaning as 
in 2, that 


vo +210 + rob? + ¢,-10" 1=0 (mod z=), 
vo 7 1b, robs 1 Pee =0 (mod irs c), 
Lo + L1bg + xQb5 iets =0 (mod z-—o), 
and this implies, multiplying the congruences by e, €1, €2,... and adding, 


gon + aT r(C) + toT r(C7) + ++» + an_1T r(¢"") = aT r(p) = 0 (mod z—c). 
Therefore, since x is not divisible by z— c, 
Tr(p) =0 (mod z—c). 
Q.ED. 


4. Now let 
r=(z-—c)(z-—a)(z-¢)::: 
be the product of all the distinct linear factors of D and let 
t = Pppipe oa 


be the product of all the distinct prime ideals dividing r. If 3 is the ramification 
ideal as above then 


(5) v3 = |[ p® = or 


and hence 


By 3, each function p in t has the property that Tr(p) is divisible by r. Now if, as 
in 810, 
e = [€1,€2,..-,€n| 
is the module complementary to o, and if p is an arbitrary function in tr, then one 
can set 
P= XE, + L2QEQ +++ + ILnEn, 
where 
at, = Tr(pw,), 
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by 810, 3. Therefore, since pw, is a function in t, x, is a polynomial function of z 
divisible by r. It follows that the ideal r is divisible by the module re. Hence the 
ideal Dt is also divisible by the ideal rDe. At the same time, 


N(Dt)=r"D""!, N(rDe) =r"D"~* (810, 10), 


hence by §8, 9, 
Dt =rDe 


or 
(6) t=re. 


It follows from this and the above remark about p that, for any function ¢ in e, 
Tr(e) is a polynomial function of z. Multiplying (6) by 3 and applying (5) we get 


t%3 =1re3 =or 


and hence 

(7) e3 = 0. 

If one multiplies the latter equation by D, then it follows from (4) that 
eps = 370, 

hence 

(8) De = 30, 

and multiplying the latter equation by e and using (7), 

(9) De 0: 


5.34 If @ is an integral algebraic function of z in Q and N(t— @) = f(t), then 
f'(@) is divisible by the ramification ideal 3. 

Proof. If f(t) is reducible then f’(@) = 0, hence it is certainly divisible by 3. 
Otherwise, by §10, 11 


ef'(0) =f 
is an ideal, hence, multiplying by 3 and using (7), 
(10) of'(0) = fs. 


At the same time, it follows by substituting 


OS RM ai, 


Ll 


0 1 n-1 
= Ek ke” 


in §10, 11 that 
N f'(0) = N(fN(3) = DN(f) 
=const.k?D —_(§10, (11) and §2, (13)), 
sO 
(11) N(f) = const.k? 


is a perfect square. 


34This subsection is wrongly numbered 4 in the original. However, Dedekind and Weber have 
correctly made cross references to §11, 5. (Translator’s note.) 
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§12. The fractional functions of z in the field 2 


SUMMARY AND COMMENTS 


Dedekind and Weber now extend their considerations from “in- 
tegers” of the function field 2 to arbitrary members. These may be 
viewed as quotients of “integers,” and indeed the denominator may 
be taken to be a polynomial. (This is analogous to the situation in 
algebraic number fields, where any element of the field is the quo- 
tient of an algebraic integer by an ordinary integer. See Dedekind 
(1877), §18.) 

Quotients of ideals, defined for modules in general in §4, serve 
to represent any principal ideal o7. If on = & then 6 is called the 
upper ideal, and a the lower ideal, of the function 7. The upper 
and lower ideals encode the zeros and poles, respectively, of 77. 

If p is a function such that op = oP where m and n are not 
divisible by the prime ideal p, and if 7 is a function whose lower 
ideal is not divisible by p, then 7 may be expressed as a polynomial 
in p. Weber (1908), §185, in his version of the Dedekind-Weber 
theory, called this the “Taylor expansion” of 7. 


1. Each function 7 in 2 can be expressed as a quotient of two integral algebraic 
functions of z (§8, 3), in infinitely many ways (the denominator can even be a 
polynomial function of z). Suppose therefore that 

v=, 
m 


where f4,v are integral algebraic functions of z (functions in 0). Now if m is the 
greatest common divisor of the two principal ideals oj, ov then 


(1) ou =am, ov=bm, 
where a, 6 are relatively prime ideals, and hence by 84, 6, 
(2) av = bu or an=b. 


Thus if @ is any function in a, then a7 is in 6, hence it is also an integral 
algebraic function of z. Conversely, if a is an integral algebraic function of z with 
the property that a7 = 6 is an integral algebraic function, then 


av = Bu 
and hence by (1) 

ab = Ba. 
Now since a,b6 are relatively prime, a must be divisible by a, and 6 by 6, which 
implies: 

a is the collection of all those integral algebraic functions a with the property 
that an is an integral algebraic function, and the collection of all these integral 
algebraic functions a7 is the ideal 6; in other words: 

b is the least common multiple of on and o, and a is the least common multiple 
of = and o. It follows that if a’, 6’ are two ideals satisfying the condition 


an = 0, 
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then a’ must be divisible by a. Suppose therefore that 
a’ = na; 
then 
b’ = nay = nb. 
Conversely, for any ideal n, 
nan = nb. 
2. Now let a,6 be two ideals satisfying the condition 
an = 6, 
whether they are relatively prime or not. By 84, 8 the quotient & is the collection 
of all those functions y with the property that ay is divisible by b. These functions 
certainly include all functions of the form w7, where w is any function in 0. But 
conversely, each function ¥ is also of this form. Since ay is divisible by 6, and hence 
also by 0, it is an ideal (because it has the properties I, II of §7). Thus 
ay = cb, 
where c is likewise an ideal and, multiplying by 7, 
by = cb. 
Now if 7 = 4 as above, and if p,¢ are integral algebraic functions with 7 = £, then 
it follows that 
boy = cbvo, 
hence 
opiu=cvo, oy=cn. 
The two together yield the theorem 
b 
3 on = -. 
(3) aor 
If b,a are relatively prime in this representation, which by 1 is always possible in 
precisely one way, then 6 will be called the upper ideal, and a the lower ideal, of 
the function 1. 
3. Again, if 
b 


an = 6, so that ons 


and if @ is any function in a and @ any function in b, then 
ee and af = ba. 
a 


It follows by taking norms that 


N(a) 
N(b) 


N(n) = const. 


4. If n,7' are two functions in w and if 
an=b, a’n’ =b' 
as in 1, but regardless of whether a, 6 or a’,6’ are relatively prime, then 


aa’nn’ = bb’. 


88 PART I 


The equations 


nae b aphe b’ 
WS a? y= qi 
therefore yield the equations 
‘ fox, OR dena a ae 
mal? on BO abl 


5. If a7 = 6b, an’ = b’ then also 


a(n +7!) = 6” 


will be an ideal, because when an, an’ are integral algebraic functions so too is 
a(n +7’). Thus if 


b ron 
a 


then it follows that 
6” 
Sil) eh Bek, 
ontn)=— 


If the two ideals 6, b’ have a common divisor, then the latter is also a divisor of b”. 


6. Now let p be a function in Q whose upper ideal is divisible by a prime ideal 
p, but not by p?. (Such functions always exist. They can even be integral algebraic 
functions of z.) Thus 
mp 
op es n ’ 
where m,n are ideals not divisible by p. Suppose also that 7 is any function in Q 


whose lower ideal is not divisible by p, so that 
on = — 
er 


where a is not divisible by p. One chooses an arbitrary function a in a that is not 
divisible by p, and a corresponding function ( in b so that 


_B 
V=-. 
a 
Let 
a=a9, B= Po (modp), co = po 
ao 


where ao, 39,Co are constants, the first of which is nonzero. By 5 we have 


B-co~ by 
0 — = o—_——————_ = ae 
(1 — co) a a 


and it follows from 
a(B — cod) = bia, B-—co~=0 (mod P), 


and the fact that a is not divisible by p, that 6; must be divisible by p. Thus if 
one sets 


)— Co = pl, 
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then the lower ideal of 7 is also not divisible by p. In this way a whole series of 


constants co,C1,.-.,Cr—1,-.- may be determined so that 
n= Cor pm, 
Mm = C1 + pl, 


Nr-1 = Cr-1 + P1r; 
where 71, 172,---,1r,--- are functions whose lower ideals have no prime factors other 


than those of the lower ideal of 7 and the upper ideal of p, with p excluded. Ac- 
cordingly, for each positive integer r, 


n=cotcipt::++¢r-1p" +p". 


If the lower ideal of ¢ is divisible by p°, but not by p*t, then one can apply these 
considerations to the function 7 = ¢p* and obtain 
s+r 


stl + str 


C=ap *+cp~ sts + Crip + rp 


§13. Rational transformations of functions in the field 2 


SUMMARY AND COMMENTS 


It was shown in §2 that any nonconstant function z1 € { satis- 
fies a polynomial equation whose coefficients are rational functions 
of z. It follows that there is a polynomial equation relating z and 
z1. This suggests that z1 may serve as an “independent variable” 
for the functions in Q just as well as z, and indeed this is the case. 

If 0 is a function such that 1,0,07,...,0"~1 is a basis of Q (rela- 
tive to z), then we can find a function 6; such that 1,0,07,...,07 ~ 
is a basis of 2 relative to z1. Moreover, z1, 0, can be expressed ra- 
tionally in terms of z,@, and vice versa. This is the “rational trans- 
formation” referred to in the title of the section, though it might 
be better called (as it is today) a birational transformation. 

The section concludes with the warning that “the concepts of 
basis, norm, trace, ..., ideal depend essentially on the choice of 
independent variable,” except when “the two variables z, z; depend 
linearly on each other.” The latter sense of “linear” is that z; = 


az+b for some numbers a,b,c, d with ad — bc 4 0. 
ceed 9 2 bes 


If z, is an arbitrary nonconstant function in the field Q (a variable in Q) then, 
as we have shown in §2, there is an irreducible algebraic equation relating z,; and 
z, which, when cleared of denominators, is of degree e in z; and degree e; in z. As 
was also shown there, e is a divisor of n, n = ef. Let this equation be 
(1 Gti?) = 0, 

With the help of this equation, each rational function ¢ of z and z; may (§1) be 
brought into the two forms 


(2) 


e-1 


C= ao t4121.+-:++%e_127, 
¢= a) + az Apia fgg Fe 


e€1— 
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and indeed uniquely, where xo, 21,...,%e— 1 are rational functions of z and af) : of), 
ee are rational functions of z}. 
Now if @ is a function such that 1,0,07,...,0"~1 is a basis®° of Q (relative to 


z), then by §2 the n functions 


1; Z1, ee ’ ae 
i 

(3) 0, O21, Oz, , Ozy, 

GE OF tegen HII ee” udp, bl ee 
likewise form such a basis, and hence it follows from (2) that the e; f = n1 functions 

1, Zs 2, Sing eee, 

0, dz, 627, dusege OZR, 
(4) eee ores eee eee eee 

Of-1, of-1z, of-1z2, ..., ef-tza-l, 
which we abbreviate by 

1 it 
es aye ses Ts 


satisfy an equation of the form 
eg + wg? eos ak oD A) -9 


only if the rational functions of), ao), tag a) of z; all vanish. It then follows by 


(2) again that each function 7 in 2 is representable uniquely in the form 


1 ee on 4... 4 cD, 


where the x are rational functions of 2. 
Each such function 7 satisfies an algebraic equation of degree n1 whose coeffi- 
cients depend rationally on z,, because 


1 1 1 uf 1 1 

nm ie at nf dr xt nf ima ayy), 
a 1 1 a 1 1 

nin re aS) nf Tv oo ns ge Te oS) 9, 


Cs 0s OY 6S) eg he Sale Si) 


ny _ Crh n1,2/12 nina ny ’ 
and hence as és mn 
ty Be 
T1 2-7) Tony =0 
ee ay oN rs — 7) 


It may now be shown that the function 7 = 6; can be chosen so that 01 satisfies 
no equation of lower degree whose coefficients depend rationally on 21. 
To prove this assertion we rely on the following theorem, which is easily proved 
by induction on m. If 
F(x1,%2,...,2m) 
is a polynomial function of x1, %2,...,2%m whose coefficients are functions in 2, not 
all vanishing, then one can replace x1, £2,...,£%m by constants or rational functions 


35Qne could take any other basis of © in place of 1,0,...,0”—1 for this argument. However, 
it suits our purpose to choose this particular one. 
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of z; in such a way that F’ becomes a nonvanishing function in Q. Because if 
F(#1,%2,...,%m) is zero for all such x1, 22,...,2%m then also 


dF = F"(x1)dx, + F'(x2)dxg +--+ + F'(tm)dtm = 0 


for dx,,dx2,...,dX which are arbitrary constants or rational functions of z,;. Now 
if 
1 1 
a= ayn! ) + zon ) fee + tn, 
and 


l= zion? + x2,ons” Speee rn,,on, 


(5) G= arian? + x2,1ms” at BT ny.» 


(1) (1) 


OY = Fim) + Lamtn o 


1 
+ +++ +2n, mini, 


then the «;;,, are homogeneous polynomials of degree h in 71, %2,...,%n, and hence 
they depend rationally on 21. 
Thus if 
(91) = amOT" + Om—107""* + +++ +0181 + a9 = 0 
is the equation of lowest degree satisfied by 6; with coefficients depending rationally 
on 21, then the functions ag, a1,...,@m satisfy the conditions 


a0%,,0 + 412,14 +++ +Am2..m = 0 (e = 1,2,...,m1) 


and m < n,. Now since not all the m x m determinants formed from the coefficients 
Xp,k can vanish (otherwise 0, would satisfy an equation of degree less than m) it 
follows from the latter equations that one can assume the ao,@j,...,@m to be 
homogeneous polynomial functions of 71, %2,...,@n,- 

Now if the equation y(61) = 0 is to hold for all x1,22,...,a%,, depending 
rationally on z1, the above theorem shows that we must also have 


dy = y'(0;)d0, + day, 6" + +--+ da,0, + day = 0, 


and when m < nj the dx, dv2,...,d%n, may be determined, without all vanishing, 
so that 

dam : ddm—1:-++: day : dag = Gm : Gm—1 1+++ 1: 1: a0 
and hence 


y’ (01) dO; = 0. 
But since y’(@1) is of degree m — 1 we must have d0, = 0, hence dx; = 0, dxz = 0, 
..., dy, = 0. Therefore we can only have m = n}. 
Thus, if 6; is determined so that the equation of lowest degree 


Fi( 62,21) =0 


really attains the degree n,, then all functions in 2 are uniquely representable in 
the form 
may?) +O +--+ aor}, 

where the coefficients a), af), oa pees 
this assumption one can represent nn, the np in this way by means of the 
equations (5). 

Thus not only can z1, 0, be expressed rationally in terms of z,0 but, conversely, 
z,0 can be expressed rationally in terms of 21,61. 


depend rationally on z;. Because under 
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The variable z that we previously took to be independent can therefore be any 
(nonconstant) function in the field 2. However, while the collection of all functions 
in the field Q remains completely unaltered, the concepts of basis, norm, trace, 
discriminant, integral algebraic function, module and ideal depend essentially on 
the choice of independent variable z. 

Only in the special case where the two variables z,z, depend linearly on each 
other is a basis of 2 relative to z also a basis of 2 relative to z;. Similarly, the 
norms, traces and discriminants are also the same for z and z in this case. 

If a, 8 are any two functions in 2 then they are related by polynomial equations 
in a and 8. 

Among these there is one, 

F(a, 8) =0, 
that by §1 has lowest possible degree in both a and , and this will be called the 
irreducible equation relating a and 3. It is unique up to a constant factor. 


https://doi.org/10.1090/hmath/039/05 
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§14. The points of the Riemann surface 


SUMMARY AND COMMENTS 


As Dedekind and Weber point out, up to this stage algebraic 
functions have been treated merely as elements of a field 2, and 
their numerical values have not been considered. Of course, a func- 
tion @ acquires a value a(P) when we evaluate a at the point P 
(which we expect to be a point of a Riemann surface). But as 
yet we do not have any “points.” Dedekind and Weber had the 
inspiration to define points in terms of the function field Q. 

Since functions acquire values from a point, a point 8 cor- 
responds to an evaluation of functions, that is, an assignment of 
values (@)o to the functions a € 2 that is consistent in the following 
sense: 


(1) ao = a if a is constant, and in general 
(II) (a+ 8)o = a0 + Bo, (IV) (a8)o = e080. 
(III) (@ — B)o = a0 — Bo, (V) (a/B)o = a0/ Bo. 


Since we are dealing with a field, the value 1/0 is going to come 
up, so we include co among the possible values. 

There is an immediate connection with prime ideals; namely, 
the integral algebraic functions that vanish at a point ‘8 form a 
prime ideal p. Conversely, each prime ideal occurs as the set of 
functions vanishing at some point ‘Y. 

These results depend on the choice of independent variable z 
but, with finitely many exceptions, the points 8 are in one-to-one 
correspondence with the prime ideals p of 2. When finitely many 
other points 98’ are added (corresponding to prime ideals p’ for the 
variable z’ = 1/z) one has all the points of the Riemann surface 
for 2. 

One could say that the prime ideals of 9 correspond to all points 
of the Riemann surface except its “points at infinity.” However, 
the points at infinity are important, so Dedekind and Weber in 
the next section introduce their concept of “polygons” (now called 
“divisors” ) in order to give a better accounting of points at infinity. 

It should be mentioned that, at this stage, the Riemann sur- 
face for 2 is just a cloud of points without obvious structure. Some 
structure will be extracted algebraically later, but not the “continu- 
ity” or connectedness of this set of points. Because of the generality 
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of the Dedekind-Weber theory, which also applies to discrete func- 
tion fields, this is unavoidable. But it means that the usual approach 
to differential quotients and differentials, via limits, is not available. 
Instead, Dedekind and Weber show in §23 that differentials can also 
be defined algebraically. 


The previous considerations of the functions in the field 2 were of a purely for- 
mal nature. All results were rational, i.e., consequences of the irreducible equation 
relating the two functions in 2 by the four operations of algebra. The numerical 
values of these functions did not come under consideration. One could even push 
the formal treatment further, without invoking other principles, by regarding two 
functions in the field 2 as independent variables instead of being related by an 
equation, in which case everything boils down to algebraic divisibility of rational 
functions of two variables. We have also worked out this approach, but it is very 
heavy going and offers no advantage in rigor over the way we have taken above. 
Now that we have carried the formal part of the investigation this far, the most 
pressing question is: to what extent is it possible to assign numerical values to the 
functions in Q so that all rational relations (identities) between these functions be- 
come correct equations between numbers? For this investigation it turns out to be 
useful to regard infinity as one definite (constant) number oo, subject to definite 
rules of calculation.?° When the domain of numbers is extended in this way the 
arithmetic operations always lead to a definite result as long as none of the terms 
oto, 0-o, 2, = occurs in the course of the calculation. The appearance of 
such indeterminacy in an equation is not to be regarded as a contradiction, since 
in this case the equation no longer expresses a definite assertion, and hence cannot 
be said to be true or untrue. The functions in the field Q include not only infinitely 
many variables but also all the constants, i.e., numbers. Thus in trying to satisfy 
the above demand we arrive at the following concept. 


1. Definition. When all individual elements a, ,y7,... of the field Q are re- 
placed by definite numerical values ao, 39,Yo,.-. in such a way that 


(I) ao =a if a is constant, and in general 

(II) (a+ 8)o = a0 + Bo, (IV) (a8)o = a080 

(II) (a — B)o = a0 — Bo, (V) (a/8)o = a0/Po, 
then such an assignment of definite values will be associated with a point % (which 
one may represent by any point in space),?” and we say that a = ag at PB, or that 
a has the value ag at $8. Two points are called different if and only if there is a 
function a in 2 to which they assign different values. 

From this definition of point we shall now deduce existence, as well as the 
scope of the concept. However, it should first be emphasized that this definition of 
“point” is an invariant concept for the field 2, in no way depending on the choice 
of independent variables used to represent the functions in the field. 


36Regarding infinity as a definite value is common and useful in function theory. This was 
done, e.g., by Riemann in order to be able to consider his surfaces representing algebraic functions 
as closed. 

37a geometric representation of the “point” is by no means necessary, and does not make 
comprehension much easier. It suffices to regard the word “point” as a short and convenient 
expression for the coexistence of values just described. 
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2. Theorem. If a point ‘8 is given, and z is a variable in (2 that is finite at B 
(such a variable exists for every point, because if z9 = oo then (4), = 0 is finite), 
then each integral algebraic function w of z has a finite value wo at $B. This is 


because between w and z there is a relation of the form 
1 1 1 
l=a—+b5+-:-+k—, 
WwW Ww w™ 


where a,b,...,k are polynomial functions of z by (II), (III), (IV), and hence they 


have finite values at 8. Consequently, (+) is nonzero, and hence wo cannot equal 


wo 
Ow. 


3. Theorem. If z is a variable that is finite at $B then the set p of all the integral 
algebraic functions 7 of z that vanish at ‘8 is a prime ideal in z. We say that the 
point $B generates this prime ideal p. If w is an integral algebraic function of z that 
has the value wo at $B then w = wo (mod p). 

Proof. If 7$,7¢ = 0 then also (7+ 7”) = 7 +76 = 0, and if w is any integral 
algebraic function of z, so that wo is finite, then 7 = 0 implies (w7)o = wo7o = 0. 
Thus p is an ideal in z (87, I, II). The ideal p is different from 0, since it does not 
contain the function “1.” 

If w has the value wo at $B, then (w — wo)o = 0, hence w = wo (mod p), and so 
each integral algebraic function of z is congruent to a constant modulo p. Therefore 
(§9, 7) p is a prime ideal. 


4. Theorem. The same prime ideal p cannot be generated by two different 
points. 

Firstly, the value of each integral algebraic function w at a point $B generating 
the ideal p is determined by the congruence w = wo (mod p). But if 7 is an 
arbitrary function in 2 then it is possible, by §12, 1, to determine two integral 
algebraic functions a, 8, not divisible by p, such that 


_ a 
y= B 

Now, since the finite values ao, 89 do not both vanish, it follows from (V) that 
No = Bo 


is likewise determined by p. 

It follows that two points at which a variable z has finite values are different if 
and only if there is an integral algebraic function of z taking different values at the 
points. 


5. Theorem. If z is any variable in 9 and p is a prime ideal in z then there is 
one (and, by 4, only one) point $8 that generates this prime ideal, and which will 
be called the null point of the ideal p. 

Proof. Let 7 be an arbitrary function in 2, and let p be one whose upper ideal 
is divisible by p but not by p?. Then by §12, 6 there is a unique way to choose an 
integer m, a nonzero finite constant c, and a function 7, whose lower ideal is not 
divisible by p, so that 

n=eo™+mp™. 
We set 
No = 0, c, 
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according as m is positive, zero, or negative. This assignment of values to the 
functions in the field Q corresponds to a point 8, since conditions (I) to (V) are 
satisfied, as one sees immediately.?% 

Each function whose upper ideal is divisible by p, in particular, each function in 
p, receives the value zero at 8 by this assignment, i.e., the point 8B thus determined 
generates the prime ideal p. 

The functions whose lower ideal is divisible by p, and only such functions, have 
the value oo at $8. This means that an integral algebraic function of z is infinite at 
no point where z has a finite value and, since a fractional function of z certainly 
has one prime ideal in its lower ideal, and hence at least one point where z is finite 
and the function value is infinite, it follows conversely that each function with no 
infinite values for finite values of z is an integral algebraic function of z. 


6. From 3, 4, 5 we obtain the following result. In order to obtain all actual 
points 8 exactly once, one takes an arbitrary variable in z in the field 9. If one 
constructs all the prime ideals p in z and the null point for each, then one obtains 
all the points 8 at which z remains finite. If $B’ is a point different from these, then 
z' = + has the finite value zero at 8’. Conversely, each point $8’ at which z’ has the 
value zero is different from the points $8. The prime ideal p’ in z’ generated by such 
a point $B’ (which consists of all integral algebraic functions of z’ vanishing at 3’) 
divides z’, and conversely, the null point of all the prime ideals p’ in z’ that divide z’ 
is a point $B’ at which z’ = 0 and hence z = oo. This finite number of extra points 
corresponding to the different p’, together with those previously derived from the 
prime ideals p in z, exhausts the collection of all points $8 making up the Riemann 
surface T. 


§15. The order numbers 


SUMMARY AND COMMENTS 


This section sets up the concepts needed to discuss zeros and 
poles of algebraic functions. Zeros of a function p are described as 
points 8 where p becomes “infinitely small,” and the order of the 
zero is the highest power of the ideal p, generated by ‘8, dividing 
the upper ideal of p. 

Similarly, poles of a function 7 are points where 7 “becomes 
infinite,” i.e., where 1/7 becomes infinitely small. If this zero of 
1/n has order r, then the pole of 7 is assigned order —r. At each 
point 58, the assignment of order numbers to functions 7 € 2 is an 
example of what we now call a discrete valuation on {). 

The possibility of describing a function by its zeros and poles, 
and their orders, leads Dedekind and Weber to introduce formal 


381f nf = cl p™ + ni pm +1 then e.g., 


where ; F 
mw EM — Cy 
e'(c! + pn) 


is a function of the same kind as 7 (the proof is even simpler in the other cases). 
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products of points that they call polygons. Thus a polygon looks 
like 

BPP ---, 
where 8, BiBo,... are points and r,r1,7r2,... are positive inte- 
gers. Today, these objects are called (positive) divisors and they 
are usually written additively: 


PB + ryPBy + reBo+--- a 


The multiplicative notation has the advantage that the greatest 
common divisor and least common multiple can be defined in the 
obvious way. 

It is likewise the case that polygons have straightforward multi- 
plication and divisibility. As Dedekind and Weber say, in subsection 
6, “the laws of divisibility of polygons agree completely with those 
for integers and ideals.” 

Polygons are natural for the description of zeros, but not for 
poles, because the exponents r, 71, 72, . .. are required to be positive. 
This is in line with the privileged role of ideals in the Dedekind- 
Weber theory—a polygon 2 = %8"5B7'585? --- generates the ideal 
a of integral algebraic functions that vanish at %B, Bi, Be,... with 
orders r, 171, 72,... respectively—but some amendment is required to 
take poles into account. This is done in 817 by means of quotients 
2/B of polygons. 

Finally, this section takes the first step towards extracting the 
“surface” structure from the Riemann surface, which until now has 
been merely a structureless set of “points.” For a field Q of degree 
n relative to z (in which case z is said to be of order 7) it is shown 
that z takes each value c at exactly n points. Thus the points of the 
Riemann surface for 9 arrange themselves into n “sheets” covering 
the “Riemann sphere” CU {oo} of values c. 


1. Definition. If $8 is a particular point, we consider all functions m7 in 2 
vanishing at p and assign each of them an order number as follows. 
Such a function p has order number 1, or is said to be infinitely small of first 


order or 0! at $8 when all quotients remain finite at $B. If p’ is a function like 


p, then & is neither 0 nor oo at $8. Conversely, if a is neither 0 nor oo at $B then 
p’ is likewise infinitely small of first order. Moreover, if for any function 7 there is 
a positive integer exponent r such that + is neither 0 nor oo at 8, then the same 
holds for rae and 7 receives the order number 1, or is said to be infinitely small of 
order r at the point 8. We shall also say that 7 is 0” at $B or 7 is 0 at PB". 

In order to decide the question of the existence of such functions p and such 
order numbers r, one takes any variable z that is finite at 5B, lets p denote the prime 
ideal in z generated by 8, and represents each function 7 (with the exception of 
the orderless constant 0) by §12 as the quotient of two relatively prime ideals in 
z. The upper ideal of each of these functions is then divisible by p, and there will 
also be such functions whose upper ideal is not divisible by p?. The latter have the 
order number 1. For the remaining functions 7 the order number is the exponent 
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of the highest power of p dividing the upper ideal, as follows immediately from the 
theorems of §12. 


2. If a function 7 has the finite value jo at $8 then we say that 7 has this value 
r-tuply at $B or at r points coinciding with B, or at $B", when the function 7 — 70 
is infinitely small of order r at $8. However, if 79 = co we say that 7 has the value 
oo r-tuply at 8B, or at r points coinciding with 8, or that 7 is co” at $B or oo at 
$B", when ; vanishes at 8”. 


3. If a function 7 becomes oo” at ‘8 then we assign it the order number —r, 
but when 7 is neither 0 nor oo at B it is assigned order number 0. It follows that 
each function in the field 2 has a definite order number at an arbitrary point , 
with the exception of the two constant functions 0 and oo. 


4. If p is a function that has order number 1 at a point ‘B, and if 7 is a function 
with the (positive, negative or vanishing) order number m, then the concluding 
theorem of §12 makes it possible to determine, for each positive r, a series of 
constants Cp, C1,...,Cr—1, the first of which does not vanish, and a function o that 
is finite at 3B, so that 


mtd een, 7 LURE 


n = cop” + cip “0+ + Cr_1p 

5. It follows immediately that the order number of a product of two or more 
functions equals the sum of the order numbers of the individual factors. 

The order number of a quotient of two functions equals the order number of 
the numerator minus that of the denominator. 

If 71,72,---,;s is a sequence of functions and m is the algebraically smallest of 
their order numbers then 


m m 
Ns = €sP 1 Os/P D 
where the constants €),€2,...,é@s are all nonvanishing. Hence, if c,,c2,...,cs are 
constants, the order number of 


N= cm + Cone +:++ + CsIs 
is m if cyey + copeg +--+ + Css is nonzero; otherwise it is greater than m. 


6. We give the name polygons®® to complexes of points, which may contain the 
same point more than once, and denote them by 2,8,¢€,.... 

We also let 28 denote the polygon obtained from the points of polygons 2 
and % by letting a point $8 that appears r-tuply in 2 and s-tuply in S appear 
(r + s)-tuply in 1%. The meaning of B" and A = PPPS? --- is then immediate, 
and the laws of divisibility of polygons agree completely with those for integers and 
ideals. Points play the role of prime factors. Thus in order to obtain the unit as 
well one must admit the polygon D containing no points (the null-gon). 

The number of points of a polygon is called its order. A polygon of order n is 
called an n-gon for short. 


39Polygons are today called (positive) divisors. Dedekind and Weber do not consider arbi- 
trary divisors, which may include points with negative integers. Instead they use quotients of 
polygons (see §17). (Translator’s note.) 


§16. CONJUGATE POINTS AND CONJUGATE VALUES 99 


The greatest common divisor of two polygons 2, 8 is the polygon that contains 
each point the least number of times it occurs in 2 or B. If it is O, then 2,8 are 
called relatively prime. 

The least common multiple of 21 and % is the polygon that contains each point 
the greatest number of times it occurs in 2 or B. If 2, are relatively prime then 
28 is their least common multiple. 

If A = PBB? --- is an arbitrary polygon, then there are always functions 
z in 2 that are infinite at none of the points of 21. Because if z is infinite at some 
points of 21, one can choose a constant c so that z—c has the value 0 at no point of 
2, and then + is finite at all points of the polygon 2. If one begins with such a 
variable, then the collection of all those integral algebraic functions of z that vanish 
at the points of the polygon 2 (each counted according to its multiplicity) forms an 
ideal a = p’p}'p;? ---, and one can say that the polygon & generates the ideal a, or 
that 2 is the null polygon of the ideal a. Here the ideal concept agrees completely 
with the concept of a system of integral algebraic functions that all vanish at the 
same fixed points. The ideal 0 is generated by the null-gon 9. 

The product of two or more ideals is generated by the product of the null 
polygons of the factors, the greatest common divisor and least common multiple by 
the greatest common divisor and least common multiple of the corresponding null 
polygons. 


7. Theorem. If z is any variable in Q and n is the degree of the field Q relative 
to z, then z takes each particular value c at exactly n points. Because if 0 denotes 
the system of all integral algebraic functions of z and c is a finite constant, then 

o(z—c)=pypy---, e1tezt---=n (§9,7), 

where pj, P2,... are distinct prime ideals in z. If one lets 81, B2,... be the null 
points of pi,pe2,... then, by 2, z has the value c¢ at e; points 1 (or at $4"), at 
eg points Py (or at B57), etc., hence at the n points of the polygon Pi'Ps?--- . 
Conversely: if $8 is a point at which z has the value c, and if p is the prime ideal in 
z generated by 8, then z = c (mod p) and hence p is one of the ideals pi, pa, ..., so 
B is one of the points 81, Bo,.... The same result also holds for c = oo, because 
n is also the degree of 2 relative to 4, hence if the latter variable takes the value 0, 
z takes the value oo, at exactly n points. It follows from §11 that there are only a 
finite number of values of the constant c for which any of the exponents €1, €2,... 
can be greater than 1. 

The number n, i.e., the number of points at which the function z has the same 
value, is called the order of the function z. The constant functions, and only these, 
have order zero. For all other functions in 2 the order is a positive integer. The 
order of a variable z is at the same time the degree of the field 2 relative to z. 


§16. Conjugate points and conjugate values 


SUMMARY AND COMMENTS 


After observing in §15 that there are n points of the Riemann 
surface where a function z € 2 takes a given value c € CU {oo}, 
Dedekind and Weber now call these points conjugate points for z. 
This recalls the concept of “conjugate” in algebraic number theory, 
where each algebraic number a satisfies an equation whose roots 
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are called its conjugates, and whose norm N(qa) is the product of 
these conjugates. 

Here, the norm N(7) of a function 7 with respect to the vari- 
able z—previously defined differently in §2—turns out to equal the 
product of the conjugates of 1. 

The other development in this section is the unveiling of the 
ramification points and their connection with the ramification ideal 
of 811. The ramification points are those n-tuples of conjugate 
points that are not all distinct, these being the points where one or 
more “sheets” of the Riemann surface come together. The rami- 
fication ideal is generated by the ramification points at which the 
independent variable z has a finite value. 

Another concept that will be important later (from §22 onward) 
is the ramification number, which counts the number of ramification 
points. In this section the connection between ramification and 
the discriminant, revealed in 811, is strengthened by relating the 
ramification number to the degree of the discriminant. 


1. Definition. If c is a particular numerical value then, as shown in 815, there 
is a corresponding polygon 2 of n (not necessarily distinct) points P’,P”,... ,p”™ 
at which the nth order variable z has this value. These n points will be called 
conjugate for z. Any one of them (together with z) determines the others. If one 
lets c continuously assume all possible values, then the polygon 2% = P's”... B™ 
moves, and indeed in such a way that all its points change simultaneously. One 
thereby obtains all existing points, with multiple points occurring (finitely often) 
only where z — 2 or + vanishes to an order higher than first. Hence the product 


of all these polygons, 
II A= T3., 


where T is the collection of all simple points, is the Riemann surface; 3, is a 
particular finite polygon, called the ramification or winding polygon of T in z. 
Each point Q in 3, is called a ramification or winding point of T in z, of order s 
when it appears s-tuply in 3,. We have s = e —1 when z — zo or + is infinitely 
small of eth order at Q. The order of the polygon 3, is called the ramification or 
winding number w, of the surface T for z. Those points of the ramification polygon 
at which z has a finite value together generate the ramification ideal in z (811). 

If one wishes to pass from this definition of the “absolute” Riemann surface, 
which is an invariant concept of the field, to the well-known conception of Riemann, 
then one has to think of the surface spread over a z-plane, which is then covered 
n-tuply everywhere except at the ramification points. 

2. Theorem. If 

, et+dz 
at bz’ 
where a,b, c,d are constants whose determinant ad — bc is nonzero,*° then 


32 = 32 Wz = Wz’. 


40Recall that this is the concept of linear transformation discussed by Dedekind and Weber in 
§13, where it was shown that such transformations leave norm, trace, and discriminant invariant. 
(Translator’s note.) 
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Because if z — zo or + is infinitely small of eth order at a point 8B then this is also 
the case for 

, _ (ad — bc)(z — 20) 

o (a + bz)(a + bz)’ 


/ 
Zi = Bi 


or, when Zo is infinite, for 
/ / —(ad = be) 
ee b(a + bz) ’ 
when zj = oo and hence a+ bz = 0, for 
1 a+ bz 


z e+dz 
In particular, if 2’ = + then the ramification number w, = w,’ is equal to the 
degree of the discriminant A,(Q) minus the number of vanishing roots of A,,(Q) = 0 
(811). 
3. Definition. The values n/,77”,...,7 taken by an arbitrary function 7 in Q 
at n points P’,P”,...,B™ that are conjugate for z are called conjugate values of 


n for z. 


4. Theorem. If Nz(7) is the norm of an arbitrary function 7) relative to z, then 
the value that this rational function of z has for z = zo is equal to the product 
n'y! ---n™ of the conjugate values of 7 belonging to z = 20, leaving aside the case 
in which this product is indeterminate, i.e., where one of these conjugate values is 
0 and another is oo. 

To prove this theorem we can assume that zo is finite; because if z = oo we 
replace z by the variable z’ = 4, which leaves the norm unaltered. We can also 
assume that the values n’,7",..., 7‘ are all finite; because if one of them is infinite 
none of the others is zero, by hypothesis, and we consider the function = instead of 
1. 

Now, under these hypotheses, let 


0(z — zo) = Py'P2’P 3? °° 
and let B81, 82, B3,... be the null points of the distinct prime ideals pj, po, ps,.... 


We construct a system of integral algebraic functions A, 4: of z by the following rule. 
Let 


\1 be divisible by p;, not by p?; 
dy be divisible by p2, not by 2; 
\3 be divisible by p3, not by 2: 


[1 be divisible by p5?,p3°,..., not by Te aes eae : 
Lz be divisible by pS", p§*,..., not by po, ptt’, pg**?,...; 
13 be divisible by pS", p%,..., not by ps, pot, ptt, ...;41 


41Such functions can be determined, by 89, 3, footnote, or by §11, 2 where one can set, e.g., 
A= p—b, paA® = v(p). 
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The n functions 


ay war, paz, ... ar 
2, bora, [2A53, cee Har’ 
L3, L338, U3A3, see L3A5° ry 


which we denote by 71, 72,...,%m, then form a basis of 2. This assertion is contained 
in the more general assertion about to be proved. 
If 


(z— 20)¢ = @1m + Lomo +--+ + Ann 


with polynomial coefficients 71, 72,...,@n, and if ¢ has finite values ¢’,¢”, ¢/",... at 
the points 81, Bz, B3,..., then all the coefficients x71, x%2,...,2%n must be divisible 
by z— 2g. In fact, e.g., the left-hand side is infinitely small of order at least e; at 
the point $8,. Hence by §15, 5 


zim + fon +++ + Gee, = pa(H1 + 2A, + +++ + te, S17 +) 


must also be infinitely small of this order. But this is impossible unless 71, %2,..., Xe, 
vanish at 98, and hence are divisible by z — zo. Q.E.D. 
It follows that we can set 
maa = pa(ai? +0 Ay te $a Dap?) 

+ a(el? + oP dy be bal PAE 

+ pall) + 2 dg tone tA AEN ae, 
where the ) of), abe 0), ... are rational functions of z that all remain finite for 
z= 2. At the points Bo, B3,... the left-hand side is infinitely small of order at 
least €2,e3,.... The same holds for 11, y3,... at Bo, but not for 2, for p14, p2,... 
at 83 but not for y3,... . Consequently 

oo) =0, a) = 0, ; gr) =0, 

20) 0, (1) _ 0 (e3—1) = 0, 


for z = zp. At 98, the left-hand side is infinitely small of order at least r. Therefore, 
when r < e 


@ = 0, al) =O. s.285 of) = 0, al” =17 


for z = zo. The same argument may be applied to the functions nu2A5, Np3A%, --- - 
Thus if one sets 


NL = ©1171 1 1,212 Te Linn; 


72 = ©2171 17 ©2,27)2 t+ LQ nTMn, 


Nn = Fn 11 + Xn ,272 Peo Zn,nMn) 


then all the terms to the left of the diagonal vanish in the determinant 


N() = Ss H£%1,1%2,2°°*Enyn 
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for z = zo, while e; of the diagonal terms equal 7’, e2 of them equal 7”, e3 equal 


yf”, ... . Thus for z = 29 


N(n) = nf nf? 7 , 
Q.ED. 


5. Since 
Tr(n) = 21,1 + 22,2 +++ +2n,n 
by the definition of trace, the same considerations lead to the theorem: 


my 


Tr(n) = ein’ +27" +e3n/" +--- 


for z = zo, though only under the hypothesis that the values 7,7”, 77", ... are finite. 
For an arbitrary constant (or rational function of z) t, Theorem 4 yields 


N(t— mo) = (¢—n)(t—a")2(¢ — f)°8 = 
for z = zg, and then comparing coefficients of equal powers of t yields an expression 
in terms of conjugate values (symmetric functions) for each of these coefficients. 


6. If 71,72,.--,;7%m is a basis for Q, then 5 immediately yields the value of the 
discriminant of this system for z = Zo: 


2 
Agaiy tien tate) = (© ning: nf) ’ 


where 7), 7’,... in denote all the conjugate values of 7, belonging to z = zo, not 
necessarily distinct but assumed finite. 


§17. Representing the functions in the field 2 by polygon quotients 


SUMMARY AND COMMENTS 


A special case of the theorem in §15 about the values taken by 
a function of order n is that a function of order zero is constant. It 
follows that each function 77 is determined, up to a constant factor, 
by the orders of its zeros and poles (or, as Dedekind and Weber say, 
by its order numbers at each point). This is because, if 7’ is any 
other function with the same order numbers at each point, 7/1’ is 
of order zero and hence constant. 

Thus Dedekind and Weber recognize that both positive and 
negative order numbers are needed to determine a function 7, and 
so they introduce polygon quotients 2/8. The upper polygon 2 
encodes the zeros of 7, and the /ower polygon S encodes the poles 
of 7. 


A function 7 in the field Q has a nonzero order number at only a finite number 
of points. The sum of all these order numbers is 0, and hence the sum of the 
positive order numbers equals the sum of the negative order numbers, and indeed 
it equals the order of the function 7 (§15). If the order numbers of a function 7 are 
known for each point $8, then the function 7 is determined up to a constant factor. 
Because if 7’ has everywhere the same order number as 7 then a (by §15, 5) has 
order number zero everywhere and hence (by §15, 7) is a constant. 
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Thus if we construct a polygon 2l in which each point where 7 has a positive 
order number is taken this order number of times, and a second polygon % in 
which we similarly take the points at which 7 has a negative order number, then 
the polygons 2,8 are of the same order, and indeed it is the order of the function 
7. Thus the function 7 is determined up to a constant factor by these polygons 
21,58. We symbolically set 

2 


B 


and call 2% the upper polygon, and 8 the lower polygon, of the function 7.4? 

The way we have set things up, the two polygons 2,8 are relatively prime. 
However, it is convenient to extend the notation so that one may also admit common 
factors in 21,95, which is done by setting 


MA AL 
MB BB’ 
where 3M denotes an arbitrary polygon. If 
ee 
13 


in this generalised notation, then a point $8 at which 7 has order number m will 
occur mj,-tuply in 2 and m2-tuply in %8, where m, — mz = m. It is still true that 
the order of 2( equals the order of 8, but they no longer equal the order of the 
function 7. 

This definition (and §15, 5) yields the following theorem immediately: if 


2 fe oe 
y= Bs’ y= Br’ 
then 
po RE n AB’ 
3B af Bar 
By 814, 5 a function 1 is an integral algebraic function of n if and only if each 
point in the lower polygon of 7! also appears in the lower polygon of 7. 


§18. Equivalent polygons and polygon classes 


SUMMARY AND COMMENTS 


Not all pairs 2(, 8 can occur in an equation 7 = *: and Weber 
(1908), §191, remarks: “Investigating this relationship is the great 
question that, in another way, is answered by Abel’s theorem.” 

The present section begins the march towards Abel’s theorem 
by defining equivalence classes of polygons, the product of equiva- 
lence classes, and divisibility of equivalence classes. 


42.1] functions in the simple vector space (7) therefore have the same symbol = and it 
would therefore be more correct to set (7) = =: However, the latter notation leads to unnecessary 


longwindedness. 
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1. Definition. Two polygons 2, 2’ with the same number of points are called 


equivalent*® when there is a function 7 in Q with the notation (§17) 
Ql 
= a 


2. Theorem. If 2 is equivalent to 2’ and 2”, then 2’ is also equivalent to 2”. 
Because 


Uo O’ = 2 
imply 
1! Q{’ 
7 


3. Definition and Theorem. All the polygons 2’, 2l”,... equivalent to a given 
polygon 2 constitute a polygon class A. By 2, each polygon belongs to one and 
only one class, because if 21,8 are two equivalent polygons that lead to the classes 
A, B, then, by 2, each polygon in B also belongs to A and conversely, hence the 
two classes are identical. 

All polygons in a class have the same order, which will be called the order of 
the class. 


4. Nevertheless there exist polygons that are equivalent to no others, so each 
of them forms a class by itself. Such polygons may be called isolated. 


5. If M is an arbitrary polygon, and 2 is equivalent to 2’, then IN is also 
equivalent to S22’. Conversely, the equivalence of IN and MA’ implies the equiv- 
alence of 2 and 2’. 


6. If 2 is equivalent to 2’, and 8 to 8’, then 2 is also equivalent to 28’. The 
class C' containing the product 28 therefore contains all products of two polygons 
respectively from the classes A, B of 2, B (as well as infinitely many other polygons 
under some circumstances), and we shall call it the product of the two classes A, B: 


C=AB=BA. 
The definition of the product of several classes, and the validity of the fundamental 
multiplication theorem, are immediate. 
7. If A, B,D are three classes that satisfy the condition 
DA= DB, 
then A = B follows. Because if 2,98, are three polygons in the classes A, B, D, 


then it follows from the hypothesis that OB is equivalent to O21, and hence % is 
equivalent to 2. 


8. If a polygon 2 in class A divides a polygon € in class C, then the same is 
true of each polygon 2’ in class A. Because it follows from € = 28, by 5, that 
¢’ = 2/8 is in C. We can therefore say, regardless of whether each polygon in class 


43Today, this equivalence relation is called linear equivalence, and its equivalence classes are 
called linear systems. (Translator’s note.) 
44That is, associativity. (Translator’s note.) 


106 PART II 


C is divisible by a polygon in class A, that the class C is divisible by the class A. 
If 8’ is any polygon in the class B of 8B, then €” = 2/8’ is also in C' and hence 


C= AB. 


Thus if C is divisible by A then there is one and (by 7) only one class B that 
satisfies the condition 
C= AB. 


§19. Vector spaces of polygons 


SUMMARY AND COMMENTS 


This section investigates vector spaces of polygons, bringing 
the concepts of linear independence, basis, and dimension into play. 
This leads, in §21, to the result that each equivalence class of poly- 
gons is a finite-dimensional vector space. 


1. If 2), M%o,..., 2%, are particular equivalent polygons, and if 2 is an arbitrary 
polygon in the same class A, then there are s functions in Q, 


MU 2, 


™ = oT 2 = o> aoe) 1s = O° 
If, at an arbitrary point ‘8, one sets 


m=eip” +op"™, 


Ns = esp" +osp"", 


as in §15, 5, where p is 0! at 8, e1,e2,...,e€, are constants, not all zero, and 
01,02,---,@s are functions finite at $B, then it follows that each function 7 in the 
vector space (71,72,---,7s), 1.e., each function of the form 


= C11 + CoN2 + +++ + Css, 


has an order number at $8 no smaller than m, and hence, by §17, that the function 
7 can be expressed in the form 


N= o> 
2 
where 2’ likewise belongs to the class A. 
If one replaces 2( by any other polygon % in the class A, and sets 


2 
C=—s, 
B 
Qt 2A 2, 
m= = se mo = = a ey neo =e = FR 


then we also have 
nG =n! = cin, + cam +--+ cons, 


and hence , 
= ce 
3B 
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Each polygon 2’ generated by the denominator 2 and the system c1,¢2,...,Cs 
of constants is therefore also generated by any other denominator % belonging 
to the same class, and the collection of all polygons 2’ corresponding to the 
different values of the constants c1,c2,...,Cs is dependent only on the polygons 
24, %o,...,2,. This collection will therefore be called a vector space of polygons 
with basis 21, %l2,..., 2,5, and will be denoted by 


(21, We, Hohe , As). 


2. If the polygons 21, %2,..., 2, have a greatest common divisor St, then the 
latter is also a divisor of each polygon 2’ in the vector space (21, 22,...,2,), by 1, 
and it may be called a divisor*® of the vector space. However, it is possible to choose 
a polygon 2’ = SB in this system so that % is relatively prime to an arbitrary 
polygon. To show this, we retain the notation of 1 and suppose a point ‘8 occurs 
p-tuply in St and v-tuply in 2. Then, if we set 

n=ep™ top", 
m is never less than w — v, and m = ys — v when one chooses the constants 
C1, C2,.-+,C€s So that 
e€=cye, + Cg9€2 +++ + Ce€s 


is nonzero. The point $8 therefore occurs at least y-tuply in 2’ and, by the last 
hypothesis, no more than p-tuply. Now since one can always choose the constants 


C1,C2,---,Cs so that an arbitrary number of expressions of the form 
/ 
y CEL, y Crp non 
in which not all the constants e,,e/,... vanish, themselves have nonzero values, the 


correctness of the assertion follows. 


3. The functions 71, 2,...,7s in 1 are linearly independent if and only if the 
same is true of the functions 7},75....,7,. We can therefore speak of the polygons 
24, %o,..., 2, being linearly dependent or independent, and of their being a linearly 
reducible or irreducible system. 

Since by §5, 4 each vector space of functions has an irreducible basis, it follows 
that each vector space of polygons has an irreducible basis. If s is the number of 
polygons in such a basis, then the vector space is said to be s-tuple, or of dimension 
s. Any s polygons in such a vector space constitute an irreducible basis if and only 
if they are independent (cf. §5, 4). 


4. If Mis an arbitrary polygon, then the polygons 2, 2o,...,2l; are linearly 
independent if and only if DNA, , Mo, ..., Mt, are linearly independent. 


§20. Lowering the dimension of the space by divisibility conditions 


SUMMARY AND COMMENTS 


By relating dimension to divisibility, Dedekind and Weber pave 
the way for bounding the dimension of an equivalence class of poly- 
gons in §21. 


45 Today called a base locus, while for us a “divisor” of the space is simply a member. (Trans- 
lator’s note.) 
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1. Let 
S = (21, Wo,..., As) 
be an s-tuple vector space with divisor 9Jt. We consider the manifold of those 
polygons 2’ in the system S that contain a given point at least once more often 
than the divisor St of the system. 
If point $B is contained p-tuply in Yt and v-tuply in a polygon 2 equivalent to 
211, 2, then, when we set 


Ay m+1 
— =m =e +o : 
A 1 1p 1p 
Ay m+1 
— = =e +o ‘ 
A 12 2P 2p 
A, m m+1 
— = =e +o : 
A 1s sP sP 
m = w—v and at least one of the constants e1,€2,...,€s, Say es, is nonzero. The 
desired polygon 2’ is then characterised by the equation 
QU ; 
a =H =cym + CoN. +--+ + Css, 
where the constants c,,C2,...,Cs are related by the condition 


ce, + C2€g +--+ + Ces = 0. 


It follows that we can set 

Q’ j 

yes = c1(€sm — €172) +++ + Cs—1(€sMs—1 — €s—1Ms)- 
But then, when we set 


! 

Mm = €sIn — €17)s, 
! 

Na = €sl)2 — €27)s, 


/ 
Ns—1 = €sl)s—1 — €s—1")s; 


we see that the functions 7 constitute an (s—1)-tuple vector space (71, 75,---; 71); 
because the functions 7),75,...,7;_, are linearly independent when, as assumed 
here, the functions 7, 72,...,7s; are. Thus the polygons 2’ also form an (s—1)-tuple 
vector space 
Ss" = ( 1 Bic a1) 
when ; 
ab 
2 
The divisor of this vector space is divisible by 928, although not necessarily iden- 
tical with it. 


= EstIh, — Evl)s- 


2. It follows immediately that the polygons of a vector space S that are divisible 
by an arbitrary r-gon KK form a vector space that is at least (s—r)-tuple. Because if 
we assume this already proved for an r-gon , then the correctness of the assertion 
for an (r+1)-gon BR follows immediately from 1, since the appearance of the point 
8, when §P is in the divisor of the vector space already reduced by , cannot affect 
the dimension except to lower it by 1. 
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It follows in particular that an s-tuple vector space always contains at least one 
polygon divisible by a given (s — 1)-gon. 


3. When r < s one can choose the r-gon in such a way that the polygons 
of the vector space S that are divisible by ® form a precisely (s — r)-tuple vector 
space. To do this one chooses a point $8 not contained in the divisor of S. The 
polygons in S divisible by 8 form an (s—1)-tuple vector space S” by 1. One chooses 
a second point 8’ not in the divisor of 5’. The polygons in S$” divisible by ’, i-e., 
the polygons in S divisible by $8’, form an (s — 2)-tuple vector space, etc. At 
the same time, it becomes clear from this construction that one can take i to be 
relatively prime to an arbitrary given polygon. If r = s this means that there is no 
polygon in S that is divisible by R. 


§21. The dimensions of polygon classes 


SUMMARY AND COMMENTS 


The study of equivalent polygons, begun in §18, now culminates 
in the result that a polygon equivalence class is finite-dimensional. 
The finite dimensionality makes it possible to define the concept of 
a proper*® class; namely, a class is proper if the greatest common 
divisor of its basis elements is the empty polygon. This concept 
turns out to be useful in the proof of the Riemann-Roch theorem. 
The first, and most substantial, case to be proved is that for proper 
classes, in §28. 


1. The polygons in a class form a vector space of finite dimension, called the 
dimension of the class. 

Proof. If one chooses s polygons 21, 2X2,..., 2, in a class A of order m, then all 
polygons in the vector space (21, 2l2,...,21;) likewise belong to the class A. The 
number of linearly independent polygons in A can therefore not be greater than 
m +1, otherwise (by §20, 2) one could find a polygon in the class divisible by an 
arbitrary (m-+1)-gon, which is absurd. Thus if s is the maximal number of linearly 
independent polygons 21, 2o,...,2l, in the class A then each polygon in the class 
must belong to the vector space (21, 2o2,...,2,), and s is the dimension of the 
class. The system 21, l2,..., 2, of polygons will be called a basis of the class. 

The isolated polygons form classes of dimension 1. 


2. If a class C' contains s and no more linearly independent polygons divisible 
by a given polygon 2 in the class A, 
C, = 2181, Co=ABo, ..., Cys = ABs, 
then C is divisible by A, and C also contains the same number of independent 
polygons 
CaS; CHB us, ©, = 0 S,, 
divisible by an arbitrary polygon 2’ equivalent to 2 (§18, 8; §19, 4). The number 


s therefore depends only on the classes A,C and can be conveniently denoted by 
(A,C). The value of the symbol (A, C) is set equal to 0 when C is not divisible by 


46Today, a proper class is called linear system without base points. (Translator’s note.) 
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A. It follows that the dimension of a class A is (O, A), where O denotes the class 
consisting of the null-gon 9. If (§18, 8) 


C= AB, 
then it follows that 
(1) (A, C) = (A, AB) = (O, B), 


because the polygons %8 1, 82,...,%8,, which all belong to B, are linearly inde- 
pendent, hence (O, B) is certainly not smaller than s. Conversely, if 8 is an 
arbitrary polygon in class B, then 2% is in C, hence also in the vector space 
(AB, AB.,...,AB,), therefore B is in the vector space (B),B2,...,Bs), ie., 
(O,B) =s. 

If a is the order of the class A, then (A,C’) > (O,C) — a, by §20, 2, and then 
by (1) we get the general theorem 


(2) (O, B) > (O, AB) - a. 


3. If all the basis polygons of a class A have the greatest common divisor M, 
then the latter is a divisor of all the polygons in class A. If 22% equals the null-gon 
9, then the class is called proper, otherwise improper with divisor IN. 

If one divides all polygons in an improper class A by the divisor 9%, then one 
obtains a proper class A’ of lower order, but of the same dimension. The relation 
between A and A’ will be expressed symbolically by 


A=MA’'. 
4. The divisor 93% of an improper class A is always an isolated polygon. If 
A=MA’, 


then by §19, 2 one can choose a polygon 2’ in the proper class A’ that is relatively 
prime to It. Thus if MN’ is equivalent to MW, then M/A’ is equivalent to M2’, and 
hence belongs to A, which means it is divisible by 0. Therefore IN’ is also divisible 
by 9, and since Nt and MM’ are of the same order, 


mM = Me’. 
Consequently the single polygon 9% constitutes a class M, and the symbol SIA’ 
means the same as MA’ (§18, 6). 


§22. The normal bases of o 


SUMMARY AND COMMENTS 


A normal basis is one that suits not only the integral closure of 
C[z], but also the integral closure of C[z’], where z’ = 1/z, thus 
allowing equal treatment of ordinary points and points at infinity. 
The construction of normal bases is the technical heart of Dedekind 
and Weber's work, and it was reworked or rediscovered by many 
later mathematicians, among them Hensel and Landsberg, Birkhoff, 
Hasse, and Grothendieck. See Geyer (1981) for an account of these 
later developments and a modern formulation of the result in terms 
of vector bundles. 

Geyer remarks, (p. 126), that the later proofs have “overall the 
same approach as Dedekind and Weber.” Since Grothendieck is 
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famous for doing things his own way, this is strong evidence for the 
“inner necessity” alluded to by Dedekind in his letter to Weber that 
was quoted in Section 12 of the Translator’s Introduction. 

Some of the technical results on bases of 0, harking back to 
results in §2 and §3, throw light on the ramification number w, 
introduced in §16. Because of the relation between the ramifica- 
tion number and the degree of the discriminant, found in §16, the 
ramification number is even. This is important in §24, where half 
of the ramification number occurs in the definition of genus. 


1. In what follows we consider the system o of integral algebraic functions w of 
an arbitrary variable z in 0 and at the same time the system o’ of integral algebraic 
functions w’ of z’ = +. It is immediately clear from the definition of integral 
algebraic function that the two systems 0,0’ have only the constants in common, 
but that each function w can be converted into a function w’ by multiplying it by 
a particular positive power of z’. If wz’" is in 0, then so too are wz'"t!,w2z/™t?,.... 
Thus in the series of functions 
= = 2, =e” : 
there is a certain term wz’ that belongs to o’ along with all its successors, while 
all the preceding terms do not. The smallest number r for which z’"w belongs to 
o’ will be called the exponent of the function w with respect to z. The constants, 
and only these, have exponent zero. If w is nonzero, and if r is its exponent, then 
r +1 is the exponent of (z — c)w. Because if w = z"w’, then 


~~ oe =(1-cz')w’ isino’, 

(z — c)w / / : : / 

[Ow cw is not ino, 
z 


since cw’ is in o’ but zw’ = >; is not. It follows from this that, in general: 


If x is a polynomial in z of degree s, and if r is the exponent of w, then (r+) 
is the exponent of rw. 


2. We now choose a function system A1, A2,...,An in o by the following rule: 

Let A, be a nonzero constant, e.g., 1. Let Ag be among the functions in 0, 
not congruent to a constant modulo oz, that have smallest possible exponent ra, 
etc. In general, let A, be among the functions in 0, not congruent to a function in 
the vector space (\1, A2,.--,As—1) (mod oz), and of smallest possible exponent 1. 
Since (0,0z) = N(z) = z” is of nth degree, there are exactly n linearly indepen- 
dent functions modulo oz in o (§6), and hence the series of functions 1, A2, A3,--- 
contains no more, and no less, than n terms. We then have (85) 


0= 4, r2, vias An) (mod 0z). 
The exponents 11,72,...,Tm of the functions A, A2,...,An satisfy the conditions 


ry =0, Lrg <7rg3<-+- <p. 


Each function in 0 with exponent <r, is congruent modulo oz to a function in the 
(s — 1)-tuple vector space 


(Ai, A2,--+;As—1): 
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These functions 1,A2,...,;An form a basis of 0, as we see from the following con- 
siderations. 

If this were not the case, one could determine (§3, 7) a linear function z —c 
and a system of constants a), @2,...,@n, not all zero, such that 


a1 + G22 + +++ + GnAn = (2 — €)w. 
If as is the last nonvanishing constant a, then 


ayA1 + agdA2 ++++ + asAs = (2 — C)w, 


and the exponent of w is certainly less than r, (since (eeu is in 0’). Thus w is 
congruent to a function of the vector space (1, A2,---,As—1) (mod oz), and then, 
since a, is nonzero, so too is A,—contrary to hypothesis. 

The functions 1, A2,...,An therefore form a basis of 0, which will be called a 
normal basis. The characteristic properties of normal bases are: 

I. The functions 1, A2,...,An are linearly independent modulo oz. 


II. Each function in 0 whose exponent is smaller than the exponent r, of A; 
belongs to the form 


C11 + CoAQ +++ + Cs—-1As—1 + 2s, 
where cj, C2,...,€s—1 are constants and w, is a function in o. 


3. The functions 
An 


Zin 


/ 
S, eel 


in o’ form a normal basis of 0’. 

If w is a function in 0 not divisible by z and of exponent r, then the exponent of 
w’ = & is likewise r with respect to 2’, because indeed $- = w is ino but =4+ = £ 
is not. Since the functions 1, A2,..-,An are all not divisible by z, it follows that 
the exponents of A},A4,...,A4, with respect to 2’ are r1,12,...,1%n respectively. 
Having established this, we now prove that the function system \4,5,...,/, has 
the properties I, II when the o, z therein are replaced by 0’, z’. 

If condition I is not satisfied then constants a1, d2,...,@,; may be determined, 
with the last one nonvanishing, so that 

aA, + a2dAg +++: + asA, = 2'0", 
and hence also (multiplying by zs) 
ayz"s— "dy +492" "rg +++ + AsA5 =, 


whence 
w= zre—ty! 
is a function in o with exponent less than r,. Since a, is nonzero, however, this is 
impossible by the hypothesis on A. Consequently, condition I is satisfied, whence 
it follows that 
o = (4, A9,---,,) (mod 0’2z’). 

If condition II is not satisfied, and \’ is a function in 0’ with exponent r < rz, 

and not belonging to the form 


ayy + a2A5 ++ + s-1A,_1 + 2'0", 
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then one can choose e > s so that 
N = aay + a2Ap +++ ted t+ 2/w" 
with constant coefficients, the last of which, a-, does not vanish. Consequently, we 
also have re > rs > 1. 
This means that \ = z’¢~!)\ is a function in o, and it follows by multiplication 
by 27° that 
ZN = yee Ay Hager Ag Hee teXe t 2 lw. 


Therefore w = z™*~1!w’ is a function in 0 whose exponent (by 1) is < r. — 1, and 


which satisfies the congruence 


w=airA\i +a5A2 +--+» + a,r. (mod oz), 


where ai, = —a, is nonzero. But then, by property II of the functions A, we must 
have the exponent of w > r., which is clearly a contradiction. 

This completes the proof that the function system 4, A5,...,/, is a normal 
basis of 0’. 


4. We now construct the discriminant of Q with respect to the variables z and 
z' with the help of the normal bases \, ’. We have 


A.(Q) = const.A(Ai, A2,---;An), 
A.(Q) = const.A(A}, Ag, ---; Xy,)- 
But if one replaces X/ by the expression 2’"'\,, then it follows from the theorem §2, 
(13) that 
Ay (Q)= const.z/20 172+") AQ). 
If A,(Q) is of degree 6, then A,/(Q) has the (2(r1 + r2 +--+ +7n) — 4)-tuple root 
z'=0, and 816, 2 then gives the ramification number 


We = Ari +7re+--:+7Tn), 


which is therefore always an even number. 


§23. The differential quotient 


SUMMARY AND COMMENTS 


Since the “Riemann surfaces” of Dedekind and Weber lack the 
structure needed to perform integration, theorems that were origi- 
nally about integrals (such as Abel’s theorem and the residue the- 
orem) are reformulated as theorems about differentials: the things 
that one integrates. Differentials can be defined algebraically, via 
the differential quotient. 

But likewise, as Weber (1908), §194, remarks: “The differential 
quotient cannot be introduced in the normal way when we make 
no use of continuity.” This is why the differential quotient ae is 
approached in a somewhat roundabout way for functions a, 6 € 2. 
It turns out, however, that if F(a, 3) = 0 is the polynomial equation 


satisfied by a and @ then 
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as in traditional calculus, where Dedekind and Weber use F’(/3) to 
denote OF /08 and F"(a) to denote OF /Oa. Since F is a polyno- 
mial, OF /08 and OF'/0a are also polynomials. Moreover, they have 
a simple algebraic meaning; for example OF'/Oa may be defined as 
the coefficient of 6 in the polynomial F(a + 0, 8). 

Likewise, the Taylor series for F'(a, 3), deployed in subsection 
2, is actually a finite sum of polynomials. 

We also have the chain rule 


da — dady 
dB dy dp’ 


and this makes it possible to associate a differential da with each 
function @ so that oo is the quotient of the differentials da, dG. 
Moreover, we have the standard rules for the differential of the sum, 
difference, product, and quotient. 

If w is an integral algebraic function, then de is not necessarily 
an integral algebraic function. However, it is shown in subsection 5 
that de belongs to the complementary module e of the module o 
of integral algebraic functions, introduced in §10. The ideal f from 
§10 also returns in this section. In subsection 6 it turns out to be 


the “ideal of double points.” 


1. Since each nonzero function in the field Q takes the value zero at only a 
finite number of points, it follows that a function in 2 with infinitely many zeros is 
necessarily identically zero, and that two functions in 2 agreeing at infinitely many 
points are identical. 


2. If a,@ are any two variables in the field 2 then there is a function in Q, 
denoted by (4s). which at infinitely many points 8 satisfies the condition 


(a),~ GR) 
dp 0 B i Bo 0 ‘ 
and is called the differential quotient of a with respect to 8. Namely, if F(a, 6) = 0 


is the irreducible equation relating a, 3 then, when we exclude the (finitely many) 
points at which ap or Go = oo or F’ (ag) = 0 or F’ (Go) = 0, we obtain 


0 = F(a, 8) 
= F(a0, Bo) + (a — a0) F" (a0) + (8 — Bo) F"(B0) 
ae 5 {(a — ao)*F" (a9, a0) + 2(a — a0)(B — Bo) F” (a0, Bo) + (8 — Bo)? F" (Bo, Bo) } 
+ ane 
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0 a 
the equation above yields the following: 


One of the two quotients ($=3) . (£2) is certainly finite. If it is the first, then 


0= Fa F'(ao) + F'(B0) 
2 
ah (B= 60)5 (S=) F" (ag, a0) +2 (S=) F(a, Bo) an ran. 


whence it follows that at the points 3B: 
(2 = sa) _ FG) (28) 
B-Bo/Jo F"(ao) F(a)] 9 
Thus 


da\ __ FB) 
(1) = )=-F 

dB F'(a@) 
has the desired property. This remains true when one of the functions a, 8 is a 


constant. Because if, e.g., a is constant, then F(a, 8) = a — ap is independent of 
8, so F'(a) = 1, F’(B) =0. 


3. It follows from the foregoing that, when ( is not constant, (33) has a 
finite value except at a finite number of points. Therefore, if y is a third variable 


in Q, then 


at infinitely many points, so we also have 


(as),- (Ge), (aa), 


From this and 1 we get the identity 
da da dy 

(2) *)\=(2)(2)" 
dp dy} \dp 


4. As a consequence of the latter theorem we can associate with each function 
a, 6,y,--- in the field 0 a function da, d@,dy,... (differential) in such a way that 


in general 
da da 
dp \dBb}- 
47Qne can also define the differential quotient by the equation 


(i) ~~ Fay 


(aa) = () 


and prove the theorem 


by algebraic division. 
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The differentials of constants, and only these, are set equal to zero. The rest are 
completely determined as soon as one of them is taken arbitrarily. If the variables 
a, B,y,... satisfy a rational equation 


F(a, 8,7,---) = 0, 
then it follows that 
(3) F'(a)da + F'(8)d@6 + F’(y)dy +++: =0. 


Because one can conclude, as with (2), that this equation holds at infinitely many 
points. 

Immediate consequences of the last theorem are the well-known rules for dif- 
ferentiation of sums, differences, products and quotients: 


(4) d(a +B) =da +B, 
(5) d(ap) = adB + B da, 
a Bda—adB 
6 d| =| = ———_.. 
© (5) =a 
5. Ifw is an integral algebraic function of z then de is not in general an integral 


algebraic function of z. However, it is clear from the expression (§3, 7) 
W = U1Wy + XQW2 +++: + LnWn, 


since the differential quotients of the polynomial functions x1, 2%2,...,% are again 
polynomials, that the lower ideals of all the functions de must divide a particular 


ideal, namely the least common multiple of the lower ideals of den dv ag dea 
We wish to find out what this ideal is. To that end let z—c be an arbitrary linear 


function of z and let 


o(z—c) = pepy py’, 
where the prime ideals p,p1,p2,... are distinct. Now let ¢ be the same function as 
in $11, 2, ie., an integral algebraic function of z with distinct values at the points 
B, Bi, Be,... generated by the prime ideals p,pi,p2,..., and which takes each of 
these values simply. Then w may be expressed in the form 


w=yotyCt+e+Yyn—10"'~ 


where the rational functions yo, y1,...,Yn—1 of z may be fractional, but the factor 
z—c certainly does not occur in a denominator. It follows that the lower ideal of 
dw 


7 cannot be divisible by higher powers of the ideals p,pi,p2,... than the lower 


ideal of a . But if the irreducible equation between ¢ and z is 
TG, z) =0 
then, by §11, 2, 
of'(¢) = mp "pepe 


and m is relatively prime to p, pi, p2,.... However, since 
ae ec) 
dz gic): 
the lower ideal of _ and hence also that of fe can contain none of the factors 


P, Pi, P2,-.. more often than (e — 1), (e: — 1), (eg — 1),... times respectively. Now 
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since z—c can be an arbitrary linear function, it follows that te can have no lower 
ideal but one that divides the ramification ideal 3 = [] p°~! (§11). Thus we have 


dw 
3 ro q, 
where a denotes an ideal, and hence by §11, (7) 
é dw 
qe = 


whence it follows that the functions & all belong to the complementary module e 


dz 
of o. 


6. If the irreducible equation F'(w,z) = 0 between w and z is of nth degree 
with respect to w, so that 1,w,w?,...,w" 1 is a basis of Q, then by §11, (10) 


oF" (w) = 3f. 
Therefore, since, 
dw  — F"(z) 
dz F"(w)’ 
oF’(z) must be divisible by the ideal f: 
oF" (z) = fa. 


One may therefore call f the ideal of the double points with respect to w, z. 


7. If B is a point at which z —c is infinitely small of first order (and hence not 
a ramification point in z), then the functions de are all finite at z, by 5. Thus if 7 


is any function in 2 that is finite at $B then one can represent it as the quotient a 


of two integral algebraic functions, of which (6 does not vanish at 8, so that wl is 
also finite at 8 by 6. 


8. Now let a, 6 be any two variables in 2. We shall investigate the behaviour 
of as at any point B. 
One chooses a variable z in 2 that is infinitely small of first order at $B. If 
a has a finite value ag at $B, then by §15, 1, and 2 one can determine a positive 
integer r and a function a’ finite and nonzero at 8 so that 
a=agt 2a’. 


This still holds when a is infinite at 8, except that r is then a negative integer and 
ao has to be replaced by an arbitrary finite constant, e.g., 0. Similarly one can set 


B= Bot 2°f". 


Then r and s are the order numbers of a — ag, 8 — 8p at the point $8, which may 
be positive or negative, but not zero. It then follows from (2) that 


! da 

da {rs tol + 2 
dp! 

ry dp 

dp spl + 2 


or 
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Now if we again use the subscript 0 to denote the value of a function at the point 


8 then, since 
da! de" 
(waz), (Fa), 
are finite by 7, 


B-Bo da\ — r 
(7) ee a) .7 2 


is therefore finite and nonzero. It follows that the order number of the differential 


quotient a is equal to the difference of the order numbers of a— ag and B — Bo. 


Ifr As then (4-3) and hence (43) is zero or infinite. On the other hand, if 
0 0 


B—Bo dp 
r = s then both values are finite and nonzero, and hence we have, in all cases, 


6 (s=8),7 (), 


Here, ag and fo are the values of a and § at 38, when these values are finite; 
otherwise they are arbitrary constants, e.g., 0. 


9. Ifa, b are the order numbers of a—ap, B—{p at $B then, when a, b are positive, 
the point 8 occurs (a — 1)-tuply, respectively, (b — 1)-tuply, in the ramification 
polygons 34,3 in a,?. But, if a is negative, 3, contains the point $B (—a — 1)- 


tuply, and similarly when b is negative (§16, 1). Thus if 2,8 denote the lower 
da 


polygons of a, 8 one obtains the following expression for ap asa polygon quotient: 
(9) da _ 3,8? 
dB 32?’ 


since the order number of a (as proved above) is always a — b. 


§24. The genus of the field 2 


SUMMARY AND COMMENTS 


It follows from the Riemann-Hurwitz formula, discussed in Sec- 
tion 4 of the Translator’s Introduction, that the genus p satisfies 


1 
p= zw—ntl, 


where w is the sum of the ramification numbers and n is the degree 
or “sheet number.” As we said there, this motivates an algebraic 
definition of genus, since n and w are so definable. 

Now is the time to introduce this definition, because the results 
of the previous section enable Dedekind and Weber to prove that 
Wa — 2Nq is independent of the choice of function a, and hence 
that p is an invariant of the field Q. 

The remainder of the section investigates the relation between 
the genus p of 2) and the orders m and n of a pair of generating 
functions of Q, ending with the formula 


p=(n—1)(m—-1)-r, 


where r is a number that counts “double points.” 
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1. If one lets wa, wg denote the ramification numbers, and ng,ng the orders, 
of the variables a, 8, then it follows from formula (9) of the preceding section that, 
since numerator and denominator of ae must have equal numbers of points, we 
have the important relation 

Wa — 2Ng = we — 2Nzg. 
Thus when one sets 
1 


which is an integer by §22, 4, then this number is independent of the choice of 
variables. It is a characteristic number of the field Q, called the genus of 2. One 
sees that this number is never negative by replacing sw by the value ry +ro+---+Tn 
from §22. Then one obtains 


(2) B= Vee Ye =I eb Ga=!), 


which cannot be negative because r2,7T3,...,Tm = IL. 


2. Let a, 8 be two functions in Q with orders m,n and the property that all 
functions in 2 are rationally expressible in terms of a, 3. Then let 


F(a, 8B) = aga” + aia” +++ 4+ An-1a+ An 
= bo h™ + 1h" 1 +--+ +bm—18 + bm = 0 


be the irreducible equation between a, 3, where ag, @1,...,@,, are polynomial func- 
tions of G and bo, 6b1,...,bm are likewise polynomial functions of a. 
Also let 
— Wh _ Bi 
a= Ql 9 B a 8B =) 


where 2 is relatively prime to 2% and By, is relatively prime to 8, so that 2, 24 
are of order m, and %8, 8, are of order n. Now 


F'(a) = naga”! + (n— 1)aga”~? + +++ + an—1, 


n-2 


aF'(a) = —aja"~! — 2aza “+= NGn, 
whence it follows that 8 
F'(a) = gaoaggm 
and similarly 
£ 
F'(8) = ———. 
(8) ANSBHm—2 


We shall now prove that the polygon & is divisible by 3g, and £ by 3a. 
For & this is easy to see under the hypothesis that at all points of 3¢ the 
function (@ has a finite value and ao has a nonzero value. Because 
a’ = aoa 

is an integral algebraic function of 8, and when one sets 

f(a’) =a”"' F(a, B) 
one has 

f(a!) = a5? F(a). 
Now since 0, f’(a’) is divisible by the ramification ideal in 6 generated by 3g, by 
§11, 5, the correctness of the assertion follows. An analogous result holds for F’({). 
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If one now subjects a, @ to arbitrary linear transformations: 
ede se rae: 
a+ ba!’ a +p” 
(a + ba’)(d — ba) = ad — be, 
(a’ +0'8'\(d' a b’B) — a'd' a Uc" 
then 
3a = 3a’, 3¢ = 3p: 
by 816, 2, and the irreducible equation between a’, 3’ reads: 
F(a’, 6’) = (a + ba’)"(a' + 0'B')™ F(a, B) = 0. 
But it is possible under all circumstances to choose the constants a, b, c, d; a’, b’, c’, d’ 
so that the above hypotheses are satisfied for both a’ and {’. 
If one puts the coefficients ap, bp of a’”, 6’" in F(a’, 8’) in the form 
ay = (a’ + 0'8')™ (aod” + a, d”1b +--+ + ab") 
a'd' — b'c! 
= ( d’ —v'B 
by = (a + ba)” (bod'™ + byd’™—*b! +--+ + bm b'™) 


) (aod” + ajdt b eee + anb"), 


ad — be\" 
_ b q'™ b d'™-1p! or by b!'™ 
(So) + by apt Pee Js 
then one easily sees that the functions a@, d’ — b’G can vanish at a point of 3, and 
bp, d— ba can vanish at a point of 3, for only a finite number of values of the ratios 
d:b, d’:0'. 

If we now set 


al? Br 
then it follows (§19, 1) that 
Ao 5 
d— ba = a + ba’ = oF, 
so: 
AyAS = AA’. 


However, if, as assumed, b is nonzero, then 24 is relatively prime to 2, because at 
a point of 2 the order number of d— ba is the same as that of a (§15, 5) and hence 


A,=A', wh, =A, 


So: o 
a+r ba’ = WW’ 
and similarly 
8B 
a’ + b’B" = Br 


But now, since F(a, 3) = 0, we have 

Fy(a’) = (ad — be)(a + ba’)"—?(a’ + 0B’) F'(a), 
and thus when 
R3~ 


Fy(a’) 7 QYn—25B Im * 
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as assumed, it follows that 


R3 
F'(a) = reco 
and similarly 
R30 
F’ = ———.. 
(8) ANSBHm—2 


The same polygon necessarily occurs in the numerators of both expressions be- 
cause 


da __F'(8) _ B°3. 


~~ Fla) ~ 35 
Now the order of the polygon 2"~?8™ is 


m(n — 2) + mn = 2m(n — 1), 
thus the order of K, 
2r = 2m(n— 1) — we, 
is always even, and this implies 


(3) p= Sup —nt1= (n—1)(m—1)-r. 


The polygon § is called the polygon of the double points in (a, (). 


§25. The differentials in 2 


SUMMARY AND COMMENTS 


In this section Dedekind and Weber extend the concept of dif- 
ferential from the differentials da (for @ € Q), introduced in §23, to 
more general differentials of the form wdz. The latter are defined so 
as to be independent of the variable z and are called differentials in 
Q. The differentials da from §23 are now called proper differentials, 
while the new ones wdz (if not of the form da) are called improper 
or Abelian differentials. 


If z,z, are any two variables in 2, with orders n,n, and ramification numbers 
w, w 1, and if 3,31 are the ramification polygons and MU, Lt; are the lower polygons*® 
of z,z1 then (§23) 


2 
(1) dz = Su : 
dz 310 
Each function w in 2 may be put in the form 
L721 
2 =o 
(2) w= ae 


where 2, 8 are polygons whose orders a, b satisfy the condition 
2n+a=w+b 

or (§24) 

(3) a=b+2p—2. 


48The letter L (Fraktur U) stands for the German word “unter,” meaning lower or under. 
(Translator’s note.) 
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Now when one defines a function w; by the equation 
wdz = wy, dz, 
then w, acquires the description 
318 
from (1). In what follows we shall call expressions such as 


Wy 


wdz =w,dz 


differentials in Q, and denote them by symbols such as dw. Such a differential is 
thereby defined invariantly, i.e., independently of the choice of variable z, and is 
completely determined by the two polygons 2, B. 

Without danger of misunderstanding we can use the symbolic notation 


2 
da = = 
O= 
and hence also, for example, 
Fine 2 


This notation for a differential as a quotient of polygons is distinguished from 
the similar notation for functions in 2 (§17) by the fact that in the latter the 
numerator and denominator are of the same order, while in differentials the order 
of the numerator exceeds that of the denominator by 2p — 2. As with the notation 
of §17, any common divisor of 21 and S can be cancelled out. If 2 and S are 
relatively prime, then 2 is called the upper polygon, and 8 the lower polygon, of 
the differential dw. 

The general concept of differentials in Q presented here includes the differentials 
of functions in the field 0, defined in §23, 4. We call the latter proper differentials, 
while the others, which cannot be represented as differentials of functions existing 
in 2 will be called improper or Abelian differentials. 

Functions of the form (2), which in our newly adopted notation may be de- 
noted by we will be called differential quotients with respect to z, and we similarly 
distinguish between proper and improper differential quotients, according as dw is 
a proper or improper differential.*? 

There now arises the problem of determining the extent of the differential con- 
cept, i.e., of finding all polygons 2,8 that can be upper and lower polygons of a 
differential. We begin with the following general remarks: 

A necessary and sufficient condition for x to be a differential is that 


LU 21 


38 
be a function in (2 for an arbitrary variable z, in other words, that {221 be equivalent 
to 3%. This relationship is preserved, however, when 2, 8 themselves are replaced 
by equivalent polygons 2’, 8’. If we hold % fixed, and if x is a differential, then 


QU Q{”” 
8 8’ 
49The quotient of any two proper or improper differentials we always has meaning as a 


definite function in 2. In what follows we confine ourselves to the consideration of quotients in 
which at least the denominator is a proper differential. 
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represent differentials if and only if the polygons 2, 2’, 2”,... all belong to the 
same class A. If the polygons 21, 22, 23,... form a basis of A, so that 


A= (21, Ao, As, oa 5 


: . . . . di, da. dé . : 
then the corresponding differential quotients te race rates malt respect to any 
variable z form a basis for a finite-dimensional vector space of functions, and we 


likewise say that dw,,dwW2,dw3,... is a basis for a vector space of differentials 
(da, d®2, daz,...). 


This means that each differential dj whose lower polygon is % or a divisor of B 
may be expressed in the form, with constant coefficients c1, c2,¢3,... 


v 


da = cy dw, + Codi. + c3duz oar 


§26. Differentials of the first kind 


SUMMARY AND COMMENTS 


Differentials are classified into three kinds, of which the first is 
treated here. This classification goes back to the classification of 
elliptic integrals into three kinds by Legendre in 1793. The differen- 
tials of the first kind are those without poles. Those of the second 
and third kind are defined in 831. 

In this section, Dedekind and Weber prove Abel's theorem for 
differentials of the first kind. Differentials of the second and third 
kind are treated in §31. However, the version of Abel’s theorem here 
is hardly recognizable as such, since it does not even mention the 
genus p. It does have a consequence (the “fundamental theorem” ) 
involving p, but a theorem resembling Abel’s statement does not 
appear until §33, where Abel’s name does not even occur. 

As mentioned in §10, the proof of Abel’s theorem hinges on 
the concept of complementary basis introduced there. It is now 
joined by the related concept of supplementary polygon, defined to 
be polygons whose product is complete, that is, the upper polygon 
of a differential. Likewise, along with differentials of the first kind, 
there are polygons of the first kind. There are also polygons of the 
second kind. But in the case of polygons, this simply means “not 
of the first kind.” 

Recalling the concept of a polygon class from §18, Dedekind 
and Weber introduce the class W of complete polygons of the first 
kind and prove that it has dimension p. In §27 they name it the 
principal class. W happens to be none other than the special class 
singled out by Roch in his contribution to the Riemann-Roch theo- 
rem. It is known today as the canonical class. 


We first consider the simplest differentials in Q, namely those for which the 
lower polygon is the null-gon 9. Such differentials (whose existence is admittedly 
not proved yet) are called differentials of the first kind. The upper polygon 2% of 
such a differential dw, whose order is 2p — 2, is called the fundamental polygon of 
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dw and it is said to be a complete polygon of the first kind, while each divisor of 
such a polygon is simply called a polygon of the first kind. If 0 = AB then 2, B 
are called supplementary polygons of each other. A polygon that is not a divisor 
of a complete polygon of the first kind, in particular any polygon with more than 
2p — 2 points, is called a polygon of the second kind. 


1. By the above remarks, all complete polygons of the first kind form a polygon 
class W, whose dimension remains to be determined. If this dimension turns out 
to be > 0, then the existence of polygons of the first kind is proved at the same 
time. But the latter dimension is the same as the dimension of the vector space of 
differentials of the first kind, which is also the same as that of the vector space of 
differential quotients of the first kind, for an arbitrary variable z, when we take the 
differential quotients of the first kind with respect to z to be the functions 


ya 
dz" 
By §25, (2), such a function wu is expressible as 
= LPG 
3 % 


and one easily sees, by consideration of the order numbers at the different points, 
that such a differential quotient of the first kind is completely defined by the fol- 
lowing two properties: 


I. At each point $8 where z has a finite value zo, 
(u(z — 20))o = 0. 
II. At a point $8 where z is infinite, 
(zu)o = 0. 
If, as in §11, 4, 
r=(z-—c)(z-—a)(z-¢)::: 

is the product of all the distinct linear factors of the discriminant A,(Q) and t is 
the product of all the distinct prime ideals dividing r, then condition I is equivalent 
to the fact that ru is a function in t, or that u is a function in the module e 


complementary to o (§11, 4(6)). Thus in order to obtain the totality of functions u 
one has to look among the functions in e for those that satisfy condition II. 


2. For this purpose we take a normal basis 1, A2,.--,; An of o (§22) and denote 
its complementary basis by j41, M2,..., Un, So that each function satisfying condition 
I, hence also each differential quotient of the first kind, is of the form 


(1) U = Yili + Yate + +++ Ynbin, 


where Yj, Y2,---;Yn are polynomial functions of z. But it follows from the basic 
properties of the complementary basis that (§10, 3) 


s Xs 
Ys = Tr(urs); 2 7=Tr (uz. ) ; 
gia Zrs 


Now since *£ is in 0’, and hence finite for z = oo, and since uz vanishes at each 


Zrs 


such point by II, it follows (§16, 5) that —4, must vanish for z = oo, i.e., the 


Zrs—l 


degree of the polynomial function y cannot exceed r, — 2. 
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Therefore, if r; < 2, y must vanish, and hence 
yi =0; Tr(u) =0 
under all circumstances (§22, 2) (Abel’s theorem for differentials of the first kind) 
and, if r, > 2: 


(2) Ye = Co Pe + ez? + + G2". 


It remains to show that these conditions are also sufficient, i.e., that each function 
of the form (1) in which y, has the form (2) satisfies condition II, or, in other 
words, that when r, > 2, z">7! [4s vanishes at all points at which z is infinite. This 
follows immediately by consideration of the system 0’ of integral algebraic functions 
of 2’ = 4, for which the functions 


pe MM ya 22 


An 
Ay = ’ 2. 


Zin 


/ 
eda Ree 


Zr Zr 


form a normal basis by §22, 3. By §10, 5 the basis complementary to this is 


y= 2, Wg = 27a, ey My = 2" Hn, 
and since 
Oi =: for. 2-0 
(by property I, applied to z’, uv’), it follows that 


Ts—l 


z Ms =0 for z=. 


Q.ED. 


But since the functions z” 1, are linearly independent (by the rational indepen- 
dence of the functions 1,), §24, (2) yields the fundamental theorem: 

The vector space of differentials of the first kind is of dimension 

(ro -—1)+(r3-1) +-+-+ (mm —1) =p, 

and hence p is also the dimension of the class W of complete polygons of the first 
kind. 

As a basis for the vector space of differential quotients of the first kind with 
respect to z one can choose the p functions z”"W, (h <r, —2), and the fundamental 


polygons I, W>,..., 20, for the corresponding differentials dw form a basis of the 
class W. 


3. For the sake of a later application we now consider a special type of differ- 
ential quotient u’ of the first kind, namely that in which condition II is replaced by 
the following condition: 

III. At each point 8 where z is infinite, 


(z*u')o =0, 
where & is a given positive integer. 
The functions u’ may be expressed as 
F Qk+1 gy’ 
= a 


and they likewise form a vector space. The polygons 29’ also form a class W’, 
whose order is 


U 


w—n(k +1) = 2p—2—n(k—1). 
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However, the polygons 25’ are not independent of the choice of variable z. The 
dimension of the class W’ may be determined by the same method as for the class 
W. Since condition I is satisfied, the functions wu’ are likewise expressible in the 


form (1). However, 
a eS Tr (vet) 
Zrs— Zrs 


must now vanish for z = oo, and therefore the degree of the polynomial y, cannot 
exceed the number r, — k — 1. Thus y, vanishes identically as soon as r, < k +1. 
Otherwise 


(3) Ye =Coterete +e, p12" Ft. 


Conversely, if y, has this form, then condition III is satisfied by the function 
ul = SV yshs, 
s 


because 
Ms) = ame 
has the value 0 for z = oo, as proved in 2. 
It follows that the dimension of the vector space of functions u’, and hence also 


that of the class W’, is 
Sor e* k), 


L 
where the sum is taken only over terms that have a positive value. If all r, —k <0 
the desired functions do not exist at all. 


Big 


§27. Polygon classes of the first and second kind 


SUMMARY AND COMMENTS 


In this section, the concepts of “supplementary,” “first kind,” 
and “second kind” in §26 are extended from polygons to their equiv- 
alence classes. In particular, classes A and B are called supplemen- 
tary if AB = W, where W is the canonical class introduced in 
§26 and called (by Dedekind and Weber) the principal class. The 
dimensions of supplementary classes are also investigated, with an 
eye to the proof of the Riemann-Roch theorem in the next section. 


If 2 is a polygon of the first kind then all polygons equivalent to 2 are likewise 
of the first kind. Because when 2 and S are supplementary polygons and 


AB = WwW 
then 

AB=W, 
where A, B are the classes of 2 and 8, and when 2’ is equivalent to 2, 2B = I’ 
is also equivalent to 20 (§18, 5). 


We therefore call such classes, which contain polygons of the first kind, polygon 
classes of the first kind. The rest are called polygon classes of the second kind. The 
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class W of complete polygons of the first kind is called the principal class, and two 
classes A, B that satisfy the condition 


AB=W 
are called supplementary classes. 
If 
Q{’ 
~ 


is a function in 2 and 2’ is relatively prime to 2, so that the class A of 2 is proper, 
we call 7 a function of the first or second kind according as the class A is of the 
first or second kind. 

If A is an arbitrary class of the first kind and q is the number of independent 
polygons 29 divisible by a polygon % in the class A then, by §21, 2, 


q= (A,W) = (O, B), 


ie., it equals the dimension of the supplementary class B of A. Similarly, (B, W) 
equals the dimension of the class A. If A is a class of the second kind then (A, W) = 
0. Since p is the dimension of W, then by §20, 2, 3 each class of order < p— 1 
is of the first kind, and in particular there are classes A of order p — k such that 
(A,W) = (O, B) =k. It follows from the same theorems that there are classes of 
order p that are of the second kind. 


§28. The Riemann-Roch theorem for proper classes 


SUMMARY AND COMMENTS 


In this section, the most substantial part of the proof of the 
Riemann-Roch theorem—for proper classes of the first kind—is car- 
ried out. Dedekind and Weber first express the theorem in terms 
of supplementary classes, then (in subsection 5) in terms of the 
canonical class W, as is more usual today. They also prove that W 
is always proper. 


The Riemann-Roch theorem, which in its usual formulation gives the number of 
arbitrary constants in a function that is infinite at a certain number of given points, 
in our presentation expresses a relation between the dimension and the order of a 
class, or between a class and its supplementary class. Confining ourselves initially 
to proper classes, we preface the derivation of this fundamental relation with the 
following remarks. 


1. In a proper class A it is always possible, by $19, 2, to choose two relatively 
prime polygons 2, 2’ (one of which can be an arbitrary member of the class). Thus 
if one sets 


Q{’ 
Z> ne 
and if 2” is an arbitrary third polygon in the class A, then 
Qy’”’ Ww Ql!’ 
Oh = or 


Hence, by 817, w is an integral algebraic function of z and © is an integral algebraic 
function of 4, It then follows from §22 that the exponent of w is < 1. 
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Conversely, if w is an integral algebraic function of z with exponent < 1 then 
it has the form 


where 2” is a polygon in the class A. Because if 


Mw WA 

2 MW 
and 2// is taken relatively prime to 2; then, since w is an integral algebraic function 
of z, it follows first that 2; cannot contain points that are not also in 2. But 2, 
also cannot contain points more often than 2 does, otherwise £ would be infinite 
at such a point (which cannot appear in 2’), and hence would not be an integral 
algebraic function of 4, Therefore 2 is divisible by 21, and w can be put in the 

Q’’ 


form aT: 


2. Thus in order to obtain the totality of polygons in the class A it suffices to 
seek the integral algebraic functions of z with exponent < 1. 

If n is the order of the class A, and hence also the order of the variable z, and 
if Ay, A2,..-,An form a normal basis of 0 with exponents r1,7r2,...,T%, amongst 
which r, is the last < 1, then each function w with exponent < 1 can be expressed 
in the form 


W = CA, + CoAQ +++ + OsA5 + 21, 


by §22, 2. But since the exponent of zw; cannot be larger than 1, w; must be a 
constant and therefore 


W = CA, + CoAQg t+ ++ tOsA5 + e412: 


Conversely, each function of this form satisfies the stated condition. Thus s + 1 is 
the dimension of the class A, which therefore, in agreement with §21, 1, is always 
<n+1. The upper bound n+ 1 can only be attained in the case p = 0, but it 
is actually attained, because in this case r2,7T3,...,7% = 1. It follows that, when 
p=0, only a single point $8 can belong to a proper class. 


3. If one of the exponents r541,7s42,---,7n is greater than 2 then certainly 
Tp > 2 also. Hence, by §26, 2, the following are differential quotients of the first 
kind with respect to z: 


_ WM oe 2172, = AA’W 

Ln 3 ’ Ln 3 3 ’ 
where 3 is the ramification polygon for z. Thus 
AWW, = wz w 


or, since 21, 2’ are relatively prime, 
W=AB, W, =| AB. 


Le., the class A is of the first kind (z is a variable of the first kind). Thus if we 
first assume that A is a class of the second kind then it follows that 


Tey, S32; evo 2.. aay. ATH 2 


and 


p=(ro—-A +--+ (rs —VD + (rsh. — VD +--+ (tn — 1 =H n- 8. 
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The dimension s + 1 of the class A is therefore 
(O,A) =n-—p+1. 
4. If we next assume that A is of the first kind and 
q=(A,W) 
as in §27, then there exist q linearly independent complete polygons of the first 
kind divisible by 2( , and the corresponding differential quotients of the first kind 
with respect to z, which are also g in number but no longer linearly independent, 
have the form 
= AFB 
3 y] 
where 8 is a polygon of 2p — 2—7n points. The class B of S is the supplementary 
class of A, and therefore its dimension equals q (§27). 
These functions v have the property, however, that at the vertices of 2, i.e., 
where z = oo, not only does zv vanish but so too does 
2 AY !2B 


20> . 


5 

It follows that they are completely determined by the property of being differential 
quotients of the first kind. Because if 

AIF 5 ~ 788. 
= ? UZ = ) 

3 3 

then if zu is to vanish at all points of 2, 23 must be divisible by &, since 2’ is 
assumed to be relatively prime to 2(. Hence by 826, 3 


G= (Pea — 2) + (Po42 = 2) 42° + Pn =) 


UV 


UV 


moreover 


p= (rosa — 1) + (ropa Dt + (tm, 

hence: 
p-q=n-Ss, S=n—-ptg. 
This contains the Riemann-Roch theorem, which we can express as follows in the 
present case, in view of §27: if A,B are supplementary classes of the first kind, at 
least one of which is proper, and if a,b are their orders, so that 
a+b=2p-2, 

then 


1 1 
(0, A) - 5a = (0, B) - 5b. 


5. When the case (A,W) = 0 is not excluded, the Riemann-Roch theorem 
comprehending both cases is: 
If A is a proper class of order n then its dimension is 


(O,A) =n—p+1+(A,W). 


Since the dimension of a proper class must be at least 2 (when it does not consist 
of a single null-gon), it follows that, when (A, W) = 0, 


n>ptl, 


whence we have the theorem due to Riemann: 
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Each function of order < p is a function of the first kind. 


6. With the help of this theorem it is easy to prove that the principal class W 
of complete polygons of the first kind is always proper. 

Namely, if Jt is the divisor of W then by §19, 2 it is possible to find a polygon 
AMIN in W such that is relatively prime to It. The class A of 2 is proper (§21, 
3) and at the same time AM is the only polygon in W divisible by 2 (since each 
polygon in W has the divisor 9%). Thus 


(A,W) = 1. 
Now p is the dimension of W, and hence also that of A. So, by the Riemann-Roch 


theorem, the order of A equals 2p — 2, i.e., it is as large as that of W. Consequently 
M=0. 


§29. The Riemann-Roch theorem for improper classes of the first kind 


SUMMARY AND COMMENTS 


The proof of the Riemann-Roch theorem is now extended to 
improper classes of the first kind, by reducing it to the case of 
proper classes. 


If A is a class of the first kind with divisor St, and if 
A=MA’', 
then A’ is a proper class of the first kind. Let B be the supplementary class of 
A; B’' that of A’; a,b the orders of A,B; and m the order of M. One obtains the 
whole class B when one cancels the factor 9M in all the polygons in B’ divisible by 
mm. Because if 
AB = 2/INB = WW, 
then 9NB belongs to the class B’ and conversely, if 
2B’ = W/IMB = wW 
then 8 belongs to the class B. 
But by §21, 2 this implies 
(O, B) 2 (0; B’) —™. 
Now A’ is a proper class of the same dimension as A and of order a — m, hence 
($28, 5) 
(O, A) = (0, A’) = a—m—p+1+(A4',W), 


or 
(O, A) = (O, B’) -—m+a-—p+l. 
Therefore 
(O, A) < (O,B)+a-pt+1= (0,B) + 5(a~), 
hence 


1 1 
(O, A) — 5° < (O, B) -— 3° 
But since the classes A, B can be interchanged, we similarly get 


1 1 
—~ip< pee 
(O, B) 5° < (O, A) 5% 
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Le., 
1 it 
(O, A) — ao (O, B) — 3 


so we have proved the Riemann-Roch theorem in the same form as in 828, 4 for 
general polygon classes of the first kind.°° > 


§30. Improper classes of the second kind 


SUMMARY AND COMMENTS 


The proof of the Riemann-Roch theorem is now completed, by 
studying subcases of the remaining case. 


We now seek the conditions under which a polygon class A of the second kind 
and order n can actually be improper, which will yield the general validity of the 
Riemann-Roch theorem. 


1. Each class A can always be converted to a proper class AN by multiplication 
by another class N of order v. Because if 2 is an arbitrary polygon in A one chooses 
a variable z that remains finite at all the points of 2% (§15, 6). Then, if 7 is an 
arbitrary function in the ideal with respect to z generated by 2, the upper polygon 
of 7 is divisible by 21, hence of the form 2(9t, and the class of 29 is proper. 


2. The dimension of the proper class AN of the second kind is 
(O,AN)=n+v—p+l 
by §28, 3, and then it follows by $21, 2 that 
(O,A)>n-—pH+l. 
Now if the divisor 3M of the class A is of order m, and if 
A= MA’, 
then A’ is a proper class of the same dimension as A, and hence (§28, 5) 
(O, A) = (0,A') =n-—m—p+1+4+(A4',W), 
so 
(A’,W) =m, 
i.e., A’ must certainly be of the first kind when A is an improper class. Thus if B’ 
is the supplementary class of A’ we also have 


(0,7 B’) = m. 
But if (O, B’) > m it is possible, by §20, 2, to find a polygon NB in B’ divisible 
by St, and 
AMB = AB = W 
50In the terminology of Christoffel (On the canonical form of Riemann’s integral of the first 


kind, Annali di Mathematica pura et applicata, Serie II, Tomo IX) 
(A,W)+a-—p=(0,B)+a—p=(O,A)-1 


is the “excess” and 
(A,W)-1=(0O,B)-1 
is the “defect” of the point system 2. 
51Here Dedekind and Weber use Christoffel’s term “point system” for what they elsewhere 
call a “polygon.” (Translator’s note.) 
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so A is of the first kind, contrary to hypothesis. Thus 
(A',W) =m 
and hence 
(O,A)=n-—p+1, 
so in this case we again get the Riemann-Roch theorem, precisely in the form of 
§28, 3. 


3. If the class A contains only a single isolated polygon, then (O, A) = n—p+1, 
hence n = p. Le., an isolated polygon of the second kind always has order p. 
Conversely, it follows from 2 that each polygon of the second kind and order p is 
isolated. 


4. Retaining the notation of 2, (O, B’) = m, and hence by the frequently used 
theorem (§20, 2) there is a polygon in B’ divisible by any (m— 1)-gon. Thus if one 
sets 

Mm = Po’, 
detaching an arbitrary point % from IM, then there is a polygon 9N’S in B and 
hence 

A’IN’B = W. 

The polygon 2/9’ = 2” and its class A” are therefore of the first kind and A has 
the form 

A= PA", 
where P denotes the class of 8. At the same time we must have (A”,W) = 
(O, B”) = 1, ie., the supplementary class B” of A” contains only a single isolated 
polygon 8”, since otherwise there would be a polygon in B” divisible by 8, and 
hence A would also be of the first kind, contrary to hypothesis. 


5. Conversely, if A” is a class of the first kind for which (A”,W) = 1, so that 
the supplementary class B” of A” consists of just an isolated polygon 8”, and if 
§B is a point not appearing in 8”, and its class is P, then A = PA” is an improper 
class of the second kind and order n, divisible by $B. 

The fact that A is of the second kind comes from the assumption that $8 does 
not divide 8”. The dimension of A is therefore 


(O,A)=n-—p+1 


by 2, where n is the order of A. On the other hand, the dimension of the class A” 
is 

(0,A") =n—p+(A",W)=n—p+1 
by §§28 and 29, thus A and A” are of the same dimension. But all the polygons in 
the class A” become polygons in the class A under multiplication by 8, and because 
of the equality of dimensions this completely exhausts the latter class. Thus all the 


polygons in the class A contain the factor 8, which is therefore also in the divisor 
of A. 


6. In the special case where the genus p of the field Q is 0, polygons and classes 
of the first kind do not occur at all. Thus in this case there are also no improper 
classes. The dimension of each class is 1 greater than its order. In particular, each 
point $8 belongs to a proper class of dimension 2, and hence in this case there are 
functions z in 2 of first order. In terms of one such function any other function in 
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the field is expressible rationally, because the irreducible equation relating z and 
the other variable is of first degree in the latter (§15, 7). 


§31. Differentials of the second and third kinds 


SUMMARY AND COMMENTS 


The different kinds of differentials, the first of which was in- 
troduced in §26, are revisited in the light of the Riemann-Roch 
theorem. One now sees where differentials of the second and third 
kinds fit in. 


1. Using the notation introduced in §25, suppose 


QL 
dw = — 
“3B 
is an arbitrary differential in 2. Thus 
a=b+2p-—2, 


where a, b are the orders of 2, 8, and if 21, 8 are relatively prime and L, 3 denote the 
lower polygon and ramification polygon for an arbitrary variable z, then 1°21 must 
be equivalent to 3% (§25). Thus if U,Z,A,B denote the classes of the polygons 
LU, 3, 2, B we have 


U?A= ZB. 
On the other hand, when W is the principal class of the first kind, 
UW =Z, 
which yields the relation 
A= BW. 


Conversely, if 2( is an arbitrary polygon in the class BW then this relation implies 
the equivalence of £721 with 3%, and hence the existence of a differential denoted 
by x. It follows that % is the lower polygon of a differential dw if and only if BW 
contains a polygon relatively prime to %, i.e., when the divisor of the class BW is 
relatively prime to 8. The dimension of the class BW is then also the dimension 
of the vector space of differentials d® associated with the lower polygon % (825). 
Since (W,W) = 1, the theorems §30, 4, 5 then yield the following result. 

a) If S consists of a single point (i.e., if b= 1) then the class BW is improper 
with divisor 8. Thus the order b of the lower polygon of a differential dw cannot 
be one. 

b) If b > 2 then BW is always a proper class of the second kind and hence its 
dimension is 

b+p-—1. 
The lower polygon of a differential can therefore be any polygon of more than one 
point, and b+ p—1 linearly independent differentials exist among those associated 
with a lower polygon of order b. 


2. Under the hypothesis that b > 2, we now seek a basis for the class A such 
that each element 2, of this basis yields a differential dw, of the simplest possible 
nature, namely, one whose lower polygon is a power of a single point or the product 
of only two different points. 


134 PART II 


Supposing that such a basis 
(1) Ay, Ao, Ws,..., Avr p—1 


has already been found for the class BW, we build a similar basis of dimension 
b+ p for the class BPW, namely 


(2) PAL, , Po, tee Bo+p-1, 2’, 


where P denotes the class of an arbitrary point 8. The first b+ p— 1 of these 
polygons really belong to the class BPW and are independent, because the polygons 
(1) are. Also, the differentials constructed from them, 


— PA, Ay 

SSB Fs 

are identical with those constructed from (1). Thus it remains only to construct 
QU’, for which we have to distinguish two cases. 

a) If B divides B and B = MP™, where M is not divisible by PB, then P™+!W 
is a proper class (because m+ 1 > 2, §30, 4) in which there is consequently a 
polygon 9 not divisible by 8. If one now sets 2’ = MN then A’ belongs to the 
class BPW and is not divisible by $8. Hence it is also not in the vector space 
(P2A1, Po,...,Py+p-1) whose divisor is PB. It follows that the polygons (2) are 
independent of each other, and since there are b+ p of them, they form a basis for 
the class BPW. The differential constructed from 2’, 

Wt mt 
=< PB a prt? 
has the required form, since its lower polygon is a power of a single point. 

b) If $8 does not divide $ one chooses once and for all a point $B; dividing B 
and sets 8 = 9, (whether or not M is divisible by 81). If one then chooses a 
polygon 9t in the proper class PP,;W not divisible by % and $B, then 2’ = Mot 
again belongs to the class PBW, and since 2’ is not divisible by B it follows as 
above that the polygons (2) form a basis for BPW. At the same time 


fe aes 
~ s 


dio, 


das’ 


hence it is of the required form. 

It now only remains to describe the beginning of this operation. If b = 0, so 
that 8 = 9, then 

BW =W = (20, We,..., Wy) 

(the principal class of the first kind). 

If 6 = 2 then one chooses, from the proper class BW, a polygon ‘t relatively 
prime to 8. Then 

BW = (820), BWe,..., BW,, N). 

If one proceeds from this basis, in the way described above, to determine a basis 
(1) corresponding to an arbitrarily given polygon 


B= PB BBP 
and determines the two polygons 2U/,, 8! by the condition 
A, We 
SB Be 


di, = 
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and so that they have no common divisor, then the polygons %/. that appear as 
lower polygons of the differentials are the following: 

a) p of them are 9, and the associated differentials dw, are differentials of the 
first kind. 

b) Appearing once, each o os the lower polygons 7, 83,..., B87" (when m1 > 2); 
953 503 osu5 hy Daye ad he Sass 

The differentials ¢ Wp Gorequondiiiy to the lower polygons 58" will be denoted by 
dt(ggr-1) when a sharper distinction is necessary, and will be called differentials of 
the second kind. 

c) Finally, each of the products 212, Bi%3,... (with fixed $81) appears once. 
The associated differentials dw, will be denoted by dai, y,.) and will be called 
differentials of the third kind. 

Each differential dw whose lower polygon is 8% can be represented in the form 


(3) dis = S~ ey divy 


with constant coefficients c,, and this will be called the normal form of the dif- 
ferential dw. Once the individual differentials dw, of a particular kind have been 
chosen, then the normal form is unique, as follows immediately from the linear 
independence of the differentials dw,.. 


§32. Residues 


SUMMARY AND COMMENTS 


With their powerful algebraic machinery for differentials now 
in place, Dedekind and Weber go on to prove an algebraic version 
of the theorem well known from complex analysis: the sum of the 
residues of an algebraic differential is zero. They reduce it to the 
case of rational functions by forming the trace. 


1. If dw is an arbitrary differential in Q and if $8 is a point that appears m-tuply 
in the lower polygon % of d& (m > 0), then one chooses a variable that is cot at 
3B. It is then possible (by §15, 4), and indeed in only one way, to set 

dw 
(1) Fem nz + m2" Fo Faz + a9 +b aaz +2, 
where the a are constants and 77 is a function in 2 that is finite at $8. The coefficient 
—a_, of —z7~! in this expression is called the residue of the differential d& with 
respect to the point 8. This definition yields the following theorems: 


2. The residue with respect to a point $8 is nonzero if and only if m > 0, i-e., 
when the point $8 really appears in the lower polygon of dw, and hence it is always 
O for differentials of the first kind. 


3. The residue of a sum of differentials equals the sum of the residues of the 
individual differentials. 


4. The residue of a proper differential is always equal to 0. Because if o is a 
function in Q, and if the b are constants and o’ is a function that is finite at % then 


o=b, 2" bbgaie He bbe Po, 
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and it follows by differentiation of this expression with respect to z, and the fact 
that dee is infinitely small of at least second order at 8 (§23, 10), that the az~+ 
term does not appear in the expression for ae This proves the assertion. 


5. The residue of a differential dw is independent of the choice of variable z. If 
z , is a second variable of the same character as 2, i.e., 


(2) Z=az,+¢, 


where a is constant and ¢ is finite at 8, then it follows, using the abbreviation 


Om—22z™! Om—32 2 
Q = —— —— 4+ mm — + :::' 4 202, 
m—1 m— 2 
that 
da dw dz da te _1 dz dz 
— = — — = — 4H 8-712 : 
dz, dz dz dz : dz, 4 dz 
Now it follows easily from §23 and §15, 4 that 
dz dz—! 
SE) oe at = eel 
dz, 71 = e' , dz ; ‘ 


where ¢’,¢” are functions that are finite at $8, and the correctness of the assertion 
then follows by 3, 4.°? 


6. The sum of the residues of any differential dw with respect to all points B 
is always zero. 


To prove this important theorem we can confine ourselves to considering the 
residues belonging to the distinct points in the lower polygon 8 of dw. However, we 
add to these the same number of distinct arbitrary points with vanishing residues 
until we obtain a polygon 1 Po...B, consisting of simple points only and belong- 
ing to a proper class. Then we choose a variable z of order n whose lower polygon 
is just this one, and which is therefore oo! at precisely the points %1,Bo,..., Bn. 
Among the latter we then have all the distinct points in 8. Under this hypothesis 
we have 


day L = We SoS L ae = 
(3) Fy = Un -22™ 244 oe 34 pa! +e, Tz 2 
for = 1,2,...,n, where n™ denotes a function that is finite at B,. If we allow the 


constants a to take the value 0 then the exponent m can be taken to be indepen- 
(4) 


dent of 4 (when not all a;,’_. vanish, m is then the exponent of the highest power 


of a point appearing in %). The theorem to be proved now says that 5°, a) = 0. 
In order to prove it, we construct the trace of the function de for the variable z 
(§2), making use of an extension of the process of §16, 4. We choose a system 


52One can also base the definition of residue on a variable r that is infinitely small of first 
order at $8. Then if 
da 
— =amr7™ +++» +air-t +n 
dr 


and 7 is finite at $B, a1 is the residue of d® with respect to P. 
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P1; P2;--+;Pn Of functions in Q as follows: let 


1=0” at Bo, Bs,...,8B,, finite and nonzero at By, 
p 
p2 =0™ at %1,B3,...,Bn, finite and nonzero at Po, 


Pn =O” at Bi, Bo,...,Br_-1, finite and nonzero at Bp, 
Now if 71, %2,...,%n are rational functions of z, and if 
N= %1P1 + Loap2 +++: +2XnPn 


is a function in (2 that is finite for z = ov, i.e., at B1, Bo,..., Br, then x1, v2,..., 2p 
must likewise be finite for z = oo. For if x1, 22,...,%p» were not all finite for z = co 
there would be a positive exponent r such that the products 7127", %2z7",...,%nz~" 
were all finite for z = co, with at least one of them, say x,z~", nonzero. But then 
equation 
nz" = az "py te: +4nz "pn 

contains the contradiction that at the point 5B, the left side and all terms but the 
first on the right-hand side vanish. 

It follows immediately, by setting 7 = 0, that the functions p1, p2,..., Pn form 
a basis of 2. Hence if one sets 

des 


(4) Ge ht ~ Teter + Li,202 + °° + Lin Pn GH 1,2,.0.5%)3 
where the z,,,, are rational functions of z, then by §2 

dw 
(5) Tr (=) = 211 +229 +-++++2n,n. 


Now it follows from (3) that z~™+?28 p, is finite for z = co and hence so too is 
dz p 


CO te 
by the property of the functions p just proved. Now, e.g., the functions p1, p3,.--, Pn 
are infinitely small of mth order at the point ‘82, while po is finite and nonzero there. 
Hence at $B the functions 
dw 
Ayes ZX11P1; 2X1 ,3P3, seey ZX1.nPn 


all vanish, and it follows that zx,,2 also vanishes for z = oo. The same follows 
for 2%1,3,..., 21,» and in general for zx,,,, as long as v,c’ are different. Therefore 
27x, is finite for z = 00. 

If one now sets 


(6) C34i°= al) 42m? + agg * foee a’) 271 Oe aa 


where x, is a new rational function, then it follows from (3) that 
da 


Ene ee), 


Ss 


and from (4) that 


(n® —2,)p. = 270,101 tess + 27-101 + ets piga te +27 ti nn. 
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Now since 7) is finite at $8, and p, is nonzero, while all the terms on the right- 
hand side are zero there, it follows that x, is also finite at the point §8,. Since z, is 
rational, this means ~, is finite for z = oo. It then follows from (5) and (6) that 


(7) Tr (=) a S- al) zm? + S- ae tie feet S- a) 271 + Sy, Ue. 


On the other hand, when { is once again the lower polygon, and 3 is the ramification 
polygon, of z, we have 
ds Qt 


dz 38” 
and %$ contains no point that is not also in Lf. It then follows as in §23°° that oe 
regarded as a function of z, is a function in the module e complementary to o, and 
consequently 


is a polynomial function of z (§11, 4). In view of this, it follows from (7) that 
Of. x, = 0, and also the theorem to be proved: 


yas = 0. 


We can also express these theorems in the following way: the residue of a 
differential of the second kind dt(3-) with respect to the point ‘P is zero. 

The residues of a differential of the third kind AT (93, 92) with respect to B81, Bo 
are equal and opposite, and certainly nonzero, otherwise dz would be a differential 
of the first kind. 

It follows from these remarks and 4 that a proper differential do in normal form 
cannot contain a differential of the third kind. It is also worth mentioning that the 


residues of the logarithmic differential de are integers, namely the order numbers 
of the function o (by virtue of §23). 


§33. Relations between differentials of the first and second kinds 


SUMMARY AND COMMENTS 


In this section, Dedekind and Weber finally prove a theorem 
that looks like Abel’s theorem, though of course with differentials 
in place of integrals. 


1. Let o be a function in 2 with lower polygon 
B’ = iat A eee. (m1,M2,... > 2) 
and ramification polygon (816) 
S = Spr pre? 


53Dedekind and Weber say §26, but §23, 5 seems more relevant, since that is where it is 
proved that the derivative of a function in 0 is in e. (Translator’s note.) 
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where G’ is not divisible by the distinct points %1,B2,.... Then in the symbolism 
of §25 we have the proper differential 
6S 6’ 
do = 


BF PR 
which shows, in the first place, that a proper differential can never be of the first 
kind. 


2. The proper differential do, which in its normal form representation can 
contain only differentials of the first and second kind, belongs to the vector space 
of those differentials whose lower polygon is 


B= PPPL? = BPP. 


Conversely, one can always find at least one proper differential do in such a vector 
space, assuming that m ,,m2,... > 2 and that 8’ belongs to a proper polygon class. 
Because by 1 it is only necessary that a function o with lower polygon %’ exist in 
Q. 


3. This now yields the following important theorem. All differentials of the 
second kind may be expressed linearly with constant coefficients in terms of p suitably 
chosen differentials of the second kind, differentials of the first kind, and proper 
differentials. 


In order to see this, one chooses an arbitrary polygon 2 of the second kind and 
order p. Now if $8 is an arbitrary point and r is a positive exponent then the polygon 
2538" is likewise of the second kind, and hence the divisor IM of the associated class 
cannot be divisible by 8; otherwise 28"~1, and hence also 2, would be a polygon 
of the first kind (§30, 4). Therefore, if one sets 

AP” = NB’, 


then 8 does not divide 9, and hence B’ contains the factor 8 exactly r times 
more often than does 2. At the same time, 8’ belongs to a proper class. Now, if 
B/ — Spmtroyim' gin” et 
then the powers %”, Bp”... of points all divide 2. Thus if we set 
B as ia Cau aaa cae S/pp'sp” Bais 


then, by 2, the vector space of differentials associated with the lower polygon $8 
certainly contains a proper differential do. The normal form representation of the 
latter certainly contains the differential 


(1) At (gym+r), 
and also some or all of the differentials 


dt (93), dt(sp2), par are dt(gym), bey , dt (ggm+r—1), 


(2) dt (3), dt (3'2), fot At agi! 
dt (37), dt (32), sey dt (ggim’’y 


in addition to differentials of the first kind. Thus the differential (1) is expressible 
linearly with constant coefficients in terms of (2), differentials of the first kind, and 
do. 


140 PART II 


Therefore, if the p-gon of the second kind is 
A= PP Por, 
one sees by repeated application of the process just described that all differentials 
of the second kind are expressible in terms of the p differentials 


(3) dt py), ---yAdtagmay, 


in the manner described in our theorem. 


Braunschweig and Konigsberg, October 1880. 
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This book is the first English translation of the classic long paper Theorie der 
algebraischen Functionen einer Verdnderlichen (Theory of algebraic functions 
of one variable), published by Dedekind and Weber in 1882. The translation 
has been enriched by a Translator’s Introduction that includes historical 
background, and also by extensive commentary embedded in the translation 
itself. 


The translation, introduction, and commentary provide the first easy access 
to this important paper for a wide mathematical audience: students, histo- 
rians of mathematics, and professional mathematicians. 


Why is the Dedekind-Weber paper important? In the 1850s, Riemann initi- 
ated a revolution in algebraic geometry by interpreting algebraic curves as 
surfaces covering the sphere. He obtained deep and striking results in pure 
algebra by intuitive arguments about surfaces and their topology. However, 
Riemann’s arguments were not rigorous, and they remained in limbo until 
1882, when Dedekind and Weber put them on a sound foundation. 


The key to this breakthrough was to develop the theory of algebraic func- 
tions in analogy with Dedekind’s theory of algebraic numbers, where the 
concept of ideal plays a central role. By introducing such concepts into the 
theory of algebraic curves, Dedekind and Weber paved the way for modern 
algebraic geometry. 
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